US007220852B1 


d2) United States Patent 

Rota et al. 


(io) Patent No.: US 7,220,852 Bl 

(45) Date of Patent: May 22, 2007 


(54) CORONAVIRUS ISOLATED FROM HUMANS 

(75) Inventars: Paul A. Rota, Decatur, GA (US); 

Larry J. Anderson, Atlanta, GA (US); 
William J. Bellini, Lilbum, GA (US); 
Cara Carthel Burns, Avondale Estates, 
GA (US); Raymond Campagnoli, 
Decatur, GA (US); Qi Chen, Marietta, 
GA (US); James A. Corner, Decatur, 
GA (US); Shannon L. Emery, Lusaka 
(ZM); Dean D. Erdman, Decatur, GA 
(US); Cynthia S. Goldsmith, Lilburn, 
GA (US); Charles D. Humphrey, 
Lilbum, GA (US); Joseph P. Icenogle, 
Atlanta, GA (US); Thomas G. Ksiazek, 
Lilbum, GA (US); Stephan S. Monroe, 
Decatur, GA (US); William Allan Nix, 
Bethlehem, GA (US); M. Steven 
Oberste, Lilburn, GA (US); Teresa C. 

T. Peret, Atlanta, GA (US); Pierre E. 
Rollin, Lilburn, GA (US); Mark A. 
Pallansch, Lilbum, GA (US); Anthony 
Sanchez, Lilbum, GA (US); Suxiang 
Tong, Alpharetta, GA (US); Sherif R. 
Zaki, Atlanta, GA (US) 

(73) Assignee: The United States of America as 

represented by the Secretary of the 
Department of Health and Human 
Services, Centers for Disease Control 
and Prevention, Washington, DC (US) 

( * ) Notice: Subject to any disclaimer, the term of this 

patent is extended or adjusted under 35 

U. S.C. 154(b) by 0 days. 

(21) Appl. No.: 10/822,904 

(22) Filed: Apr. 12, 2004 

Related U.S. Application Data 

(60) Provisional application No. 60/465,927, filed on Apr. 
25, 2003. 

(51) Int. Ci. 

C12N15/50 (2006.01) 

C12N 7/00 (2006.01) 

(52) U.S. Ci. 536/23.72; 435/235.1 

(58) Field of Classification Search . 536/23.72; 

514/44; 435/235.1,320.1 
See application file for complete search history. 

(56) References Cited 

U.S. PATENT DOCUMENTS 

2005/0181357 Al* 8/2005 Peiris et al. 435/5 


FOREIGN PATENT DOCUMENTS 

WO WO 2004/085633 * 10/2004 

WO WO 2004/092360 * 10/2004 

OTHER PUBLICATIONS 

GenBank Accession No. AY274119, “SARS Coronavirus Tor2, 
complete genome,” Version AY274119.1, Apr. 14, 2003.* 
GenBank Accession No. AY278487, “SARS Coronavirus BJ02, 
partial genome,” Version AY278487.1, Apr. 21, 2003.* 

Genbank Accession No. AY278554, “SARS Coronavirus CUHK- 
Wl, complete genome.,” Version AY278554.1, Apr. 18, 2003.* 
GenBank Accession No. AY278491, “SARS Coronavirus HKU- 
39849, complete genome,” Version AY278491.1, Apr. 18, 2003.* 
SARS-associated Coronavims. Genomic Sequence Availability. 
[online] [retreived on Aug. 8, 2005]. Retreived from the Internet 
<URL: http://www.bcgsc.ca/bioinfo/SARS>.* 

GenBank Accession No. AY274119, Apr. 14, 2003. 

GenBank Accession No. AY278741, Apr. 21, 2003. 

“Update: Outbreak of Severe Acute Respiratory 

Syndrome—Worldwide, 2003 MMWR Weekly 52:241-248 (2003). 
Emery et al., “Real-Time Reverse Transcription-Polymerase Chain 
Reaction Assay for SARS-Associated Coronavims,” Emerg. Infect. 
Diseases 10: 311-316 (2004). 

Goldsmith et al., “Ultrastructural Characterization of SARS 
Coronavirus,” Emerg. Infect. Diseases 10: 320-326 (2004). 

Ksiazek et al., “A Novel Coronavims Associated with Severe Acute 
Respiratory Syndrome,” N. Eng. J. Med. 348: 1953-1966 (2003). 
Luo and Luo, “Initial SARS Coronavims Genome Sequence Analy¬ 
sis Using a Bioinformatics Platform,” 2" d Asia-Pacific Bioinformat- 
ics Conference ( APBC2004 ), Dunedin, New Zealand (2004). 
Marra et al., "The Genome Sequence of the SARS-Associated 
Coronavims,” Science 300: 1393-1404 (2003). 

Supplementary Online Material for Marra et al. «www. 
sciencemag.org/cgi/content/full/1085953/DCl». 

Rota et al., “Characterization of a Novel Coronavims Associated 
with Severe Acute Respiratory Syndrome,” Science 300: 1394-1399 
(2003). 

Supplementary Online Material for Rota et al. «www.sciencemag. 
org/cgi/content/full/1085952/DCl». 

* cited by examiner 

Primary Examiner —Mary E. Mosher 

(74) Attorney, Agent, or Firm — Klarquist Sparkman LLP 

(57) ABSTRACT 

Disclosed herein is a newly isolated human coronavims 
(SARS-CoV), the causative agent of severe acute respiratory 
syndrome (SARS). Also provided are the nucleic acid 
sequence of the SARS-CoV genome and the amino acid 
sequences of the SARS-CoV open reading frames, as well as 
methods of using these molecules to detect a SARS-CoV 
and detect infections therewith. Immune stimulatory com- 
positions are also provided, along with methods of their use. 
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CORONAVIRUS ISOLATED FROM HUMANS 

PRIORITY CLAIM 

This application Claims the benefit of U.S. Provisional 
Patent Application No. 60/465,927 filed Apr. 25, 2003, 
which is incorporated herein by reference in its entirety. 

STATEMENT OF GOVERNMENT SUPPORT 

This invention was made by the Centers for Disease 
Control and Prevention, an agency of the United States 
Government. Therefore, the U.S. Government has certain 
rights in this invention. 

FIELD OF THE DISCLOSURE 

This invention relates to a newly isolated human Coro¬ 
navirus. More particularly, it relates to an isolated Coronavi¬ 
rus genome, isolated Coronavirus proteins, and isolated 
nucleic acid molecules encoding the same. The disclosure 
further relates to methods of detecting a severe acute respi¬ 
ratory syndrome-associated Coronavirus and compositions 
comprising immunogenic Coronavirus compounds. 

BACKGROUND 

The coronaviruses (order Nidovirales, family Coronaviri- 
dae, genus Coronavirus) are a diverse group of large, envel- 
oped, positive-stranded RNA viruses that cause respiratory 
and enteric diseases in humans and other animals. At 
approximately 30,000 nucleotides (nt), their genome is the 
largest found in any of the RNA viruses. Coronaviruses are 
spherical, 100-160 nm in diameter with 20-40 nm complex 
club shaped surface projections surrounding the periphery. 
Coronaviruses share common structural proteins including a 
spike protein (S), membrane protein (M), envelope protein 
(E), and, in a subset of coronaviruses, a hemagglutinin- 
esterase protein (HE). The S protein, a glycoprotein which 
protrades from the virus membrane, is involved in host cell 
receptor binding and is a target for neutralizing antibodies. 
The E and M proteins are involved in virion fonnation and 
release from the host cell. Coronavirus particles are found 
within the cisternae of the rough endoplasmic reticulum and 
in vesicles of infected host cells where virions are 
assembled. The Coronavirus genome consists of two open 
reading frames (ORFla and ORFlb) yielding an RNA 
polymerase and a nested set of subgenomic mRNAs encod¬ 
ing structural and nonstructural proteins, including the S, E, 
M, and nucleocapsid (N) proteins. The genus Coronavirus 
includes at least 13 species which have been subdivided into 
at least tliree groups (groups I, II, and III) on the basis of 
serological and genetic properties (deVries et al., Sem. Virol. 
8:33-47, 1997; Fields et al. eds. Fields Virology, 3rd edition, 
Raven Press, Philadelphia, 1323-1341, 1996; Mahey and 
Collier eds. Microbiology and Microbial Infections , Volume 
1 Virology, 9 ,h edition, Oxford University Press, 463-479, 
1998). 

The tliree known groups of Coronavirus are associated 
with a variety of diseases of humans and domestic animals 
(for example, cattle, pigs, cats, dogs, rodents, and birds), 
including gastroenteritis and upper and lower respiratory 
tract disease. Known coronaviruses include human Coro¬ 
navirus 229E (HCoV-229E), canine Coronavirus (CCoV), 
feline infectious peritonitis virus (FIPV), porcine transmis- 
sible gastroenteritis virus (TGEV), porcine epidemic diar- 
rhea virus (PEDV), human Coronavirus OC43 (HcoV- 
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OC43), bovine Coronavirus (BCoV), porcine 
hemagglutinating encephalomyelitis virus (HEV), rat sialo- 
dacryoadenitis virus (SDAV), mouse hepatitis virus (MHV), 
mrkey Coronavirus (TCoV), and avian infectious bronchitis 
5 virus (IBV-Avian) (Fields et al. eds. Fields Virology, 3rd 
edition, Raven Press, Philadelphia, 1323-1341, 1996; 
Mahey and Collier eds. Microbiology and Microbial Infec¬ 
tions, Volume 1 Virology, 9 rh edition, Oxford University 
Press, 463-479, 1998). 

1° Coronavirus infections are generally host specific with 
respect to infectivity and clinical Symptoms. Coronaviruses 
further exhibit marked tissue tropism; infection in the incor- 
rect host species or tissue type may result in an abortive 
infection, mutant virus production and altered virulence. 
15 Coronaviruses generally do not grow well in cell culture, 
and animal models for human Coronavirus infection are 
lacking. Therefore, little is known about them (Fields et al. 
eds. Fields Virology, 3rd edition, Raven Press, Philadelphia, 
1323-1341, 1996). The known human coronaviruses are 
20 notably fastidious in cell culture, preferring select cell lines, 
organ culture, or suckling mice for propagation. Coronavi¬ 
ruses grown in cell culture exhibit varying degrees of 
virulence and/or cytopathic effect (CPE) depending on the 
host cell type and culture conditions. The only human or 
25 animal Coronavirus which has been shown to grow in Vero 
E6 cells is PEDV, and it requires the addition of trypsin to 
culture medium for growth in Vero E6 cells. Moreover, 
PEDV adapted to Vero E6 cell culture results in a strikingly 
different CPE, with cytoplasmic vacuoles and the formation 
30 of large syncytia (Hofmann and Wyler, J. Clin. Micro. 
26:2235-39, 1988; Kusanagi et el., J. Vet. Med. Sei. 554: 
313-18, 1991). 

Coronavirus have not previously been known to cause 
severe disease in humans, but have been identified as a 
35 major cause of upper respiratory tract illness, including the 
common cold. Repeat infections in humans are common 
within and across serotype, suggesting that immune 
response to Coronavirus infection in humans is either incom- 
plete or short lived. Coronavirus infection in animals can 
40 cause severe enteric or respiratory disease. Vaccination has 
been used successftilly to prevent and control some Coro¬ 
navirus infections in animals. The ability of animal-specific 
coronavimses to cause severe disease raises the possibility 
that Coronavirus could also cause more severe disease in 
4; ' humans (Fields et al. eds. Fields Virology, 3rd edition, Raven 
Press, Philadelphia, 1323-1341, 1996; Mahey and Collier 
eds. Microbiology and Microbial Infections, Volume 1 Virol¬ 
ogy, 9 ,h edition, Oxford University Press, 463-479, 1998). 

In late 2002, cases of life-threatening respiratory disease 
with no identifiable etiology were reported from Guangdong 
Province, China, followed by reports from Vietnam, Canada, 
and Hong Kong of severe febrile respiratory illness that 
spread to household members and health care workers. The 
syndrome was designated “severe acute respiratory Syn¬ 
drome” (SARS) in Febraary 2003 by the Centers for Disease 
Control and Prevention ( MMWR. 52:241-48, 2003). 

Past efforts to develop rapid diagnostics and vaccines for 
Coronavirus infection in humans have been hampered by a 
60 lack of appropriate research models and the moderate course 
of disease in humans. Therefore, a need for rapid diagnostic 
tests and vaccines exists. 

SUMMARY OF THE DISCLOSURE 

65 

A newly isolated human Coronavirus has been identified 
as the causative agent of SARS, and is tenned SARS-CoV. 
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The nucleic acid sequence of the SARS-CoV genome and 
the amino acid sequences of the SARS-CoV open reading 
frames are provided herein. 

Ulis disclosure provides methods and compositions use- 
ful in detecting the presence of a SARS-CoV nucleic acid in 5 
a sample and/or diagnosing a SARS-CoV infection in a 
subject. Also provided are methods and compositions usefi.il 
in detecting the presence of a SARS-CoV antigen or anti¬ 
body in a sample and/or diagnosing a SARS-CoV infection 
in a subject. 10 

The foregoing and other features and advantages will 
become more apparent front the following detailed descrip- 
tion of several embodiments, wliich proceeds with reference 
to the accompanying figures. 

BRIEF DESCRIPTION OF THE FIGURES 15 

FIGS. 1A-B are photomicrographs illustrating typical 
early cytopathic effects seen with Coronavirus isolates and 
serum from SARS patients. FIG. 1Ais a photomicrograph of 
Vero E6 cells inoculated with an oropharyngeal specimen 20 
from a SARS patient (x40). FIG. 1B is a photomicrograph 
of infected Vero E6 cells reacting with the serum of a 
convalescent SARS patient in an indirect fluorescent anti¬ 
body (IFA) assay (x400). 

FIGS. 2A-B are electronmicrographs illustrating ultra- 25 
structural characteristics of the SARS-associated Coronavi¬ 
rus (SARS-CoV). FIG. 2A is a thin-section electron-micro- 
scopical view of viral nucleocapsids aligned along the 
membrane of the rouglt endoplasmic reticulum (arrow) as 
particles bud into the cisternae. Enveloped virions have 30 
surface projections (arrowhead) and an electron-lucent Cen¬ 
ter. Directly under the viral envelope lies a characteristic 
ring fonned by the helical nucleocapsid, often seen in 
cross-section. FIG. 2B is a negative stain (methylamine 
tungstate) electronmicrograph showing stain-penetrated , . 
Coronavirus particle with the typical internal helical nucleo- 
capsid-like structure and club-shaped surface projections 
surrounding the periphery of the particle. Bars: 100 nm. 

FIG. 3 is an estimated maximum parsimony tree illustrat¬ 
ing putative phylogenetic relationships between SARS-CoV 
and other human and animal coronaviruses. Phylogenetic 40 
relationships are based on sequence alignment of 405 nucle- 
otides of the Coronavirus polymerase gene ORFlb (nucleic 
acid 15,173 to 15,578 of SEQ ID NO: 1). The three major 
Coronavirus antigenic groups (I, II and III), represented by 
HcoV-229E, CCoV, FIPV, TGEV, PEDV, HcoV-OC43, 45 
BCoV, HEV, SDAV, MHV, TCoV, and IBV-Avian, are 
shown shaded. Bootstrap values (100 replicates) obtained 
from a 50% majority rale consensus tree are plotted at the 
main internal branches of the phylogram. Branch lengths are 
proportionate to nucleotide differences. 50 

FIG. 4 is a pictorial representation of neighbor joining 
trees illustrating putative phylogenetic relationships 
between SARS-CoV and other human and animal coronavi¬ 
ruses. Amino acid sequences of the indicated SARS-CoV 
proteins were compared with those from reference viruses 5J 
representing each species in the three groups of coronavi¬ 
ruses for which complete genomic sequence information 
was available [group 1: HCoV-229E (AF304460); PEDV 
(AF353511); TGEV (AJ271965); group 2: BCoV 
(AF220295); MHV (AF201929); group 3: infectious bron- 
cliitis virus (M95169)]. Sequences for representative strains 60 
of other Coronavirus species, for wliich partial sequence 
information was available, were included for some of the 
structural protein comparisons [group 1: CCoV (D13096); 
FCoV (AY204704); porcine respiratory Coronavirus 
(Z24675); group 2: HCoV-OC43 (M76373, L14643, 65 
M93390); HEV (AY078417); rat Coronavirus (AF207551)]. 
Sequence aligmnents and neighbor-joining trees were gen- 
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erated by using Clustalx 1.83 with the Gönnet protein 
comparison matrix. The resulting trees were adjusted for 
final output using treetool 2 . 0 . 1 . 

FIGS. 5A-C are photomicrographs illustrating diffuse 
alveolar damage in a patient with SARS (FIGS. 5A-B), and 
imm unohistochernical staining of SARS-CoV-infected Vero 
E6 cells (FIG. SC). FIG. 5A is a photomicrograph of lung 
tissue from a SARS patient (x50). Diffuse alveolar damage, 
abundant foamy macrophages and multinucleated syncytial 
cells are present; hematoxylin and eosin stain was used. FIG. 
SB is a higher magnification photomicrograph of lung tissue 
from the same SARS patient (x250). Syncytial cells show no 
conspicuous viral inclusions. FIG. SC is a photomicrograph 
of immunohistochemically stained SARS-CoV-infected 
cells (x250). Membranous and cytoplasmic inununostaining 
of individual and syncytial Vero E6 cells was achieved using 
feline anti-FIPV-1 ascitic fluid. Immtmoalkaline phos- 
phatase with naphthol-fast red Substrate and hematoxylin 
counter stain was used. 

FIG. 6A-B are electronmicrographs illustrating ultra- 
structural characteristics of a coronavirus-infected cell in 
bronchoalveolar lavage (BAL) from a SARS patient. FIG. 
6A is an electronmicrograph of a coronavirus-infected cell. 
Numerous intracellular and extracellular particles are 
present; virions are indicated by the arrowheads. FIG. 6B is 
a higher magnification electronmicrograph of the area seen 
at the arrow in FIG. 6A (rotated clockwise approximately 
90°). Bars: FIG. 6A, 1 gm; FIG. 6B, 100 nm. 

FIGS. 7A-C illustrate the Organization of the SARS-CoV 
genome. FIG. 7A is a diagram of the overall Organization of 
the 29,727-nt SARS-CoV genomic RNA. The 72-nt leader 
sequence is represented as a small rectangle at the left-most 
end. ORFsla and 1b, encoding the nonstructural polypro- 
teins, and those ORFs encoding the S, E, M, and N structural 
proteins, are indicated. Vertical position of the boxes indi- 
cates the phase of the reading frame (phase 1 for proteins 
above the line, phase two for proteins on the line and phase 
three for proteins below the line). FIG. 7B is an expanded 
view of the structural protein encoding region and predicted 
mRNA transcripts. Known structural protein encoding 
regions (dark grey boxes) and regions and reading frames 
for potential products X1-X5 (light gray boxes) are indi¬ 
cated. Lengths and map locations of the 3'-coterminal 
mRNAs expressed by the SARS-CoV are indicated, as 
predicted by identification of conserved transcriptional regu- 
latory sequences. FIG. 7C is a digitized image of a nylon 
membrane showing Northern blot analysis of SARS-CoV 
mRNAs. Poly(A)+ RNA from infected Vero E6 cells was 
separated on a formaldehyde-agarose gel, transferred to a 
nylon membrane, and hybridized with a digoxigenin-labeled 
riboprobe overlapping the 3'-untranslated region. Signals 
were visualized by chemiluminescence. Sizes of the SARS- 
CoV mRNAs were calculated by extrapolation from a log¬ 
linear fit of the molecular mass tnarker. Lane 1, SARS-CoV 
mRNA; lane 2, Vero E6 cell mRNA; lane 3, molecular mass 
rnarker, sizes in kB. 

SEQUENCE LISTING 

The nucleic and amino acid sequences listed in the 
accompanying sequence listing are shown using Standard 
letter abbreviations for nucleotide bases, and three letter 
code for amino acids, as defined in 37 C.F.R. 1.822. Only 
one Strand of each nucleic acid sequence is shown, but the 
complementary Strand is understood as included by any 
reference to the displayed Strand. In the accompanying 
sequence listing: 
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SEQ ID NO: 1 shows the nucleic acid sequence of the 
SARS-CoV genome. 

SEQ ID NO: 2 shows the amino acid sequence of the 
SARS-CoV polyprotein la (encoded by nucleic acid 265 to 
nucleic acid 13,398 of SEQ ID NO: 1). 

SEQ ID NO: 3 shows the amino acid sequence of the 
SARS-CoV polyprotein lb (encoded by nucleic acid 13,398 
to 21,482 of SEQ ID NO: 1). 

SEQ ID NO: 4 shows the amino acid sequence of the 
SARS-CoV S protein (encoded by nucleic acid 21,492 to 
25,256 of SEQ ID NO: 1). 

SEQ ID NO: 5 shows the amino acid sequence of the 
SARS-CoV XI protein (encoded by nucleic acid 25,268 to 
26,089 of SEQ ID NO: 1). 

SEQ ID NO: 6 shows the amino acid sequence of the 
SARS-CoV X2 protein (encoded by nucleic acid 25,689 to 
26,150 of SEQ ID NO: 1). 

SEQ ID NO: 7 shows the amino acid sequence of the 
SARS-CoV E protein (encoded by nucleic acid 26,117 to 
26,344 of SEQ ID NO: 1). 

SEQ ID NO: 8 shows the amino acid sequence of the 
SARS-CoV M protein (encoded by nucleic acid 26,398 to 
27,060 of SEQ ID NO: 1). 

SEQ ID NO: 9 shows the amino acid sequence of the 
SARS-CoV X3 protein (encoded by nucleic acid 27,074 to 
27,262 of SEQ ID NO: 1). 

SEQ ID NO: 10 shows the amino acid sequence of the 
SARS-CoV X4 protein (encoded by nucleic acid 27,273 to 
27,638 of SEQ ID NO: 1). 

SEQ ID NO: 11 shows the amino acid sequence of the 
SARS-CoV X5 protein (encoded by nucleic acid 27,864 to 
28,115 of SEQ ID NO: 1). 

SEQ ID NO: 12 shows the amino acid sequence of the 
SARS-CoV N protein (encoded by nucleic acid 28,120 to 
29,385 of SEQ ID NO: 1). 

SEQ ID NOs: 13-15 show the nucleic acid sequence of 
several SARS-CoV-specific oligonucleotide primers. 

SEQ ID NOs: 16-33 show the nucleic acid sequence of 
several oligonucleotide primers/probes used for real-time 
reverse transcription-polymerase chain reaction (RT-PCR) 
SARS-CoV assays. 

SEQ ID NOs: 34-35 show the nucleic acid sequence of 
two degenerate primers designed to anneal to sites encoding 
conserved Coronavirus amino acid motifs. 

SEQ ID NOs: 36-38 show the nucleic acid sequence of 
several oligonucleotide primers/probe used as Controls in 
real-time RT-PCR assays. 

DETAILED DESCRIPTION OF SEVERAL 
EMBODIMENTS 


I. Abbreviations 


M: 

Coronavirus membrane protein 

N: 

Coronavirus nucleoprotein 

ORF: 

open reading frame 

PCR 

polymerase chain reaction 

RACE: 

5' rapid amplification of cDNA ends 

RT-PCR: 

reverse transcription-polymerase chain reaction 

S: 

coronavims Spike protein 

SARS: 

severe acute respiratory syndrome 

SARS-CoV: 

severe acute respiratory syndrome-associated Coronavirus 

TRS: 

transcriptional regulatory sequence 


II. Terms 

Unless otherwise noted, technical tenns are used accord- 
ing to conventional usage. Definitions of common terms in 
molecular biology may be found in Benjamin Lewin, Genes 
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VII, published by Oxford University Press, 2000 (ISBN 
019879276X); Kendrew et al. (eds.), The Encyclopedia of 
Molecular Biologv , published by Blackwell Publishers, 

1994 (ISBN 0632021829); and Robert A. Meyers (ed.), 
5 Molecular Biology and Biotechnology: a Comprehensive 

Desk Reference, published by Wiley, John & Sons, Inc., 

1995 (ISBN 0471186341); and other similar references. 

As used herein, the singulär tenns “a,” “an,” and “the” 

include plural referents unless context clearly indicates 
io otherwise. Similarly, the word “or” is intended to include 
“and” unless the context clearly indicates otherwise. Also, as 
used herein, the tenn “comprises” means “includes.” Hence 
“comprising A or B” means including A, B, or A and B. It 
is further to be understood tliat all nucleotide sizes or amino 
15 acid sizes, and all molecular weight or molecular rnass 
values, given for nucleic acids or polypeptides are approxi- 
mate, and are provided for description. Although methods 
and materials similar or equivalent to those described herein 
can be used in the practice or testing of the present invention, 
suitable methods and materials are described below. All 
20 publications, patent applications, patents, and other refer¬ 
ences mentioned herein are incorporated by reference in 
their entirety. In case of conflict, the present specification, 
including explanations of tenns, will control. In addition, the 
materials, methods, and examples are illustrative only and 
25 not intended to be limiting. 

In order to facilitate review of the various embodiments of 
this disclosure, the following explanations of specific terms 
are provided: 

Adjuvant: A substance that non-specifically enhances the 
30 immune response to an antigen. Development of vaccine 
adjuvants for use in humans is reviewed in Singh et al. ( Nat. 
Biotechnol. 17:1075-1081, 1999), which discloses that, at 
the time of its publication, aluminum salts and the MF59 
microemulsion are the only vaccine adjuvants approved for 
35 human use. 

Amplification: Amplification of a nucleic acid molecule 
(e.g., a DNA or RNA molecule) refers to use of a laboratory 
technique that increases the number of copies of a nucleic 
acid molecule in a sample. An example of amplification is 
40 the polymerase chain reaction (PCR), in which a sample is 
contacted with a pair of oligonucleotide primers under 
conditions that allow for the hybridization of the primers to 
a nucleic acid template in the sample. The primers are 
extended under suitable conditions, dissociated from the 
45 template, re-annealed, extended, and dissociated to amplify 
the number of copies of the nucleic acid. The product of 
amplification can be characterized by such techniques as 
electrophoresis, restriction endonuclease cleavage patterns, 
oligonucleotide hybridization or ligation, and/or nucleic acid 
_ n sequencing. 

Other examples of amplification methods include Strand 
displacement amplification, as disclosed in U.S. Pat. No. 
5,744,311; transcription-free isothermal amplification, as 
disclosed in U.S. Pat. No. 6,033,881; repair chain reaction 
amplification, as disclosed in WO 90/01069; ligase chain 
55 reaction amplification, as disclosed in EP-A-320,308; gap 
filling ligase chain reaction amplification, as disclosed in 
U.S. Pat. No. 5,427,930; and NASBA™ RNA transcription- 
free amplification, as disclosed in U.S. Pat. No. 6,025,134. 
An amplification method can be modified, including for 
60 example by additional Steps or coupling the amplification 
with another protocol. 

Animal: Living multi-cellular vertebrate oiganisms, a 
category that includes, for example, mammals and birds. 
The tenn mammal includes both human and non-human 
65 ma mm als. Similarly, the tenn “subject” includes both 
human and veterinary subjects, for example, humans, non¬ 
human primates, dogs, cats, horses, and cows. 
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Antibody: A protein (or protein complex) that includes 
one or more polypeptides substantially encoded by immu- 
noglobulin genes or fragments of immunoglobulin genes. 
The recognized immunoglobulin genes include the kappa, 
lambda, alpha, gamma, delta, epsilon, and mu constant 5 
region genes, as well as the myriad immunoglobulin vari¬ 
able region genes. Light chains are classified as either kappa 
or lambda. Heavy chains are classified as gamma, mu, alpha, 
delta, or epsilon, which in tum define the immunoglobulin 
classes, IgG, IgM, IgA, IgD and IgE, respectively. . 

The basic immunoglobulin (antibody) structural unit is 
generally a tetramer. Each tetramer is composed of two 
identical pairs of polypeptide chains, each pair having one 
“light” (about 25 kDa) and one “heavy” (about 50-70 kDa) 
chain. The N-terminus of each chain defines a variable 
region of about 100 to 110 or more amino acids primarily 15 
responsible for antigen recognition. The temis “variable 
light chain” (V £ ) and “variable heavy chain” (V^,) refer, 
respectively, to these light and heavy chains. 

As used herein, the term “antibodies” includes intact 
immunoglobulins as well as a number of well-characterized 20 
fragments. For instance, Fabs, Fvs, and single-chain Fvs 
(SCFvs) that bind to target protein (or epitope within a 
protein or fusion protein) would also be specific binding 
agents for that protein (or epitope). These antibody frag¬ 
ments are defined as follows: (1) Fab, the fragment which 25 
contains a monovalent antigen-binding fragment of an anti¬ 
body molecule produced by digestion of whole antibody 
with the enzyrne papain to yield an intact light chain and a 
portion of one heavy chain; (2) Fab', the fragment of an 
antibody molecule obtained by treating whole antibody with ,, ) 
pepsin, followed by reduction, to yield an intact light chain 
and a portion of the heavy chain; two Fab' fragments are 
obtained per antibody molecule; (3) (Fab 1 ),, the fragment of 
the antibody obtained by treating whole antibody with the 
enzyme pepsin without subsequent reduction; (4) F(ab') 2 , a 
dimer of two Fab' fragments held together by two disulfide 35 
bonds; (5) Fv, a genetically engineered fragment containing 
the variable region of the light chain and the variable region 
of the heavy chain expressed as two chains; and (6) single 
chain antibody, a genetically engineered molecule contain¬ 
ing the variable region of the light chain, the variable region 40 
of the heavy chain, linked by a suitable polypeptide linker as 
a genetically fused single chain molecule. Methods of mak- 
ing these fragments are routine (see, for example, Harlow 
and Lane, Using Antibodies: A Laboratorv Manual, CSHL, 
New York, 1999). ' 45 

Antibodies for use in the methods and devices of this 
disclosure can be monoclonal or polyclonal. Merely by way 
of example, monoclonal antibodies can be prepared from 
murine hybridomas according to the classical method of 
Köhler and Milstein ( Nature 256:495-97, 1975) or deriva- 50 
tive methods thereof. Detailed procedures for monoclonal 
antibody production are described in Harlow and Lane, 
Using Antibodies: A Laboratorv Manual, CSHL, New York, 
1999. 

Antigen: A compound, composition, or substance that can 
stimulate the production of antibodies or a T-cell response in 
an animal, including compositions that are injected or 
absorbed into an animal. An antigen reacts with the products 
of specific humoral or cellular immunity, including those 
induced by heterologous immunogens. In one embodiment, 
an antigen is a Coronavirus antigen. 60 

Binding or Stable Binding: An oligonucleotide binds or 
stably binds to a target nucleic acid if a sufficient amount of 
the oligonucleotide fonns base pairs or is hybridized to its 
target nucleic acid, to permit detection of that binding. 
Binding can be detected by either physical or functional 65 
properties of the target: oligonucleotide complex. Binding 
between a target and an oligonucleotide can be detected by 
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any procedure known to one skilled in the art, including 
functional or physical binding assays. Binding can be 
detected functionally by detennining whether binding has an 
observable efifect upon a biosynthetic process such as 
expression of a gene, DNA replication, transcription, trans- 
lation, and the like. 

Physical methods of detecting the binding of complemen- 
tary Strands of DNA or RNA are well known in the art, and 
include such methods as DNase I or Chemical footprinting, 
gel shift and affinity cleavage assays, Northern blotting, 
Southern blotting, dot blotting, and light absorption detec¬ 
tion procedures. For example, a method which is widely 
used, because it is so simple and reliable, involves observing 
a change in light absorption of a solution containing an 
oligonucleotide (or an analog) and a target nucleic acid at 
220 to 300 nm as the temperature is slowly increased. If the 
oligonucleotide or analog has bound to its target, there is a 
sudden increase in absorption at a characteristic temperature 
as the oligonucleotide (or analog) and target dissociate or 
melt. 

The binding between an oligomer and its target nucleic 
acid is frequently characterized by the temperature (T m ) at 
which 50% of the oligomer is melted from its target. A 
higher T m means a stronger or more stable complex relative 
to a complex with a lower T„. 

cDNA (complementary DNA): A piece of DNA lacking 
internal, non-coding Segments (introns) and regulatory 
sequences that determine transcription. cDNAis synthesized 
in the laboratory by reverse transcription from messenger 
RNA extracted from cells. 

Electrophoresis: Electrophoresis refers to the migration of 
charged solutes or particles in a liquid medium linder the 
influence of an electric field. Electrophoretic separations are 
wddely used for analysis of macromolecules. Of particular 
importance is the identification of proteins and nucleic acid 
sequences. Such separations can be based on differences in 
size and/or Charge. Nucleotide sequences have a uniform 
Charge and are therefore separated based on differences in 
size. Electrophoresis can be performed in an unsupported 
liquid medium (for example, capillary electrophoresis), but 
more commonly the liquid medium travels through a solid 
supporting medium. The most widely used supporting media 
are gels, for example, polyacrylamide and agarose gels. 

Sieving gels (for example, agarose) impede the flow of 
molecules. The pore size of the gel determines the size of a 
molecule that can flow freely through the gel. The amount of 
time to travel through the gel increases as the size of the 
molecule increases. As a result, small molecules travel 
through the gel more quickly than large molecules and thus 
progress further from the sample application area than laiger 
molecules, in a given time period. Such gels are used for 
size-based separations of nucleotide sequences. 

Fragments of linear DNA migrate through agarose gels 
with a mobility that is inversely proportional to the log 10 of 
their molecular weight. By using gels with different con- 
centrations of agarose, different sizes of DNA fragments can 
be resolved. Higher concentrations of agarose facilitate 
Separation of small DNAs, while low agarose concentrations 
allow resolution of larger DNAs. 

Hybridization: Oligonucleotides and their analogs hybrid- 
ize by hydrogen bonding, which includes Watson-Crick, 
Hoogsteen or reversed Hoogsteen hydrogen bonding, 
between complementary bases. Generally, nucleic acid con- 
sists of nitrogenous bases that are either pyrimidines (cy- 
tosine (C), uracil (U), and thymine (T)) or purines (adenine 
(A) and guanine (G)). These nitrogenous bases form hydro¬ 
gen bonds between a pyrimidine and a purine, and the 
bonding of the pyrimidine to the purine is referred to as 
“base pairing.” More specifically, A will hydrogen bond to 
T or U, and G will bond to C. “Complementary” refers to the 
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base pairing that occurs between to distinct nucleic acid 
sequences or two distinct regions of the same nucleic acid 
sequence. 

“Specifically hybridizable” and “specifically complemen- 
tary” are terms that indicate a sufficient degree of comple- 5 
mentarity such that stable and specific binding occurs 
between the oligonucleotide (or its analog) and the DNA or 
RNA target. The oligonucleotide or oligonucleotide analog 
need not be 100 % complementary to its target sequence to 
be specifically hybridizable. An oligonucleotide or analog is 
specifically hybridizable when binding of the oligonucle¬ 
otide or analog to the target DNA or RNA molecule inter- 
feres with the normal fimction of the target DNA or RNA, 
and there is a sufficient degree of complementarity to avoid 
non-specific binding of the oligonucleotide or analog to 
non-target sequences under conditions where specific bind- 15 
ing is desired, for example under physiological conditions in 
the case of in vivo assays or Systems. Such binding is 
referred to as specific hybridization. 

Hybridization conditions resulting in particular degrees of 
stringency will vary depending upon the nature of the 20 
hybridization method of choice and the composition and 
length of the hybridizing nucleic acid sequences. Generally, 
the temperature of hybridization and the ionic strength 
(especially the Na + and/or Mg** concentration) of the 
hybridization buffer will determine the stringency of hybrid- 25 
ization, though wash times also influence stringency. Cal- 
culations regarding hybridization conditions required for 
attaining particular degrees of stringency are discussed by 
Sambrook et al. (ed.), Molecular Cloning: A Laboratory 
Manual, 2” d ed., vol. 1-3, Cold Spring Harbor Laboratory , () 
Press, Cold Spring Harbor, N.Y., 1989, chapters 9 and 11; 
and Ausubel et al. Short Protocols in Molecular Biology, 4 ,h 
ed., John Wiley & Sons, Inc., 1999. 

For purposes of the present disclosure, “stringent condi¬ 
tions” encompass conditions under which hybridization will 
only occur if there is less than 25% mismatch between the 35 
hybridization molecule and the target sequence. “Stringent 
conditions” may be broken down into particular levels of 
stringency for more precise definition. Thus, as used herein, 
“moderate stringency” conditions are those under which 
molecules with more than 25% sequence mismatch will not 40 
hybridize; conditions of “medium stringency” are those 
under which molecules with more than 15% mismatch will 
not hybridize, and conditions of “high stringency” are those 
under which sequences with more than 10 % mismatch will 
not hybridize. Conditions of “very high stringency” are 45 
those under which sequences with more than 6 % mismatch 
will not hybridize. 

Immune Stimulatory Composition: A term used herein to 
mean a composition usefi.il for stimulating or eliciting a 
specific immune response (or immimogenic response) in a 50 
vertebrate. In some embodiments, the immunogenic 
response is protective or provides protective immunity, in 
that it enables the vertebrate animal to better resist infection 
with or disease progression from the organism against which 
the vaccine is directed. 

Without wishing to be boimd by a specific theory, it is 
believed that an immunogenic response may arise from the 
generation of an antibody specific to one or more of the 
epitopes provided in the immune stimulatory composition. 
Altematively, the response may comprise a T-helper or 
cytotoxic cell-based response to one or more of the epitopes 60 
provided in the immune stimulatory composition. All three 
of these responses may originate from naive or memory 
cells. One specific example of a type of immune stimulatory 
composition is a vaccine. 

In some embodiments, an “effective amoimt” or 65 
“immune-stimulatory amount” of an immune stimulatory 
composition is an amoimt which, when administered to a 
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subject, is sufficient to engender a detectable immune 
response. Such a response may comprise, for instance, 
generation of an antibody specific to one or more of the 
epitopes provided in the immune stimulatory composition. 
Altematively, the response may comprise a T-helper or 
CTL-based response to one or more of the epitopes provided 
in the immune stimulatory composition. All three of these 
responses may originate from naive or memory cells. In 
other embodiments, a “protective effective amount” of an 
immune stimulatory composition is an amount which, when 
administered to a subject, is sufficient to confer protective 
immunity upon the subject. 

Inhibiting or Treating a Disease: Inhibiting the full devel¬ 
opment of a disease or condition, for example, in a subject 
who is at risk for a disease such as SARS. “Treatment” refers 
to a therapeutic intervention that ameliorates a sign or 
Symptom of a disease or pathological condition after it has 
begun to develop. As used herein, the term “ameliorating,” 
with reference to a disease, pathological condition or Symp¬ 
tom, refers to any observable beneficial effect of the treat- 
ment. The beneficial effect can be evidenced, for example, 
by a delayed onset of clinical Symptoms of the disease in a 
susceptible subject, a reduction in severity of some or all 
clinical Symptoms of the disease, a slower progression of the 
disease, a reduction in the number of relapses of the disease, 
an improvement in the overall health or well-being of the 
subject, or by other parameters well known in the art that are 
specific to the particular disease. 

Isolated: An “isolated” microorganism (such as a virus, 
bacterium, fungus, or protozoan) has been substantially 
separated or purified away from microorganisms of different 
types, strains, or species. Microorganisms can be isolated by 
a variety of techniques, including serial dilution and cultur- 
ing. 

An “isolated” biological component (such as a nucleic 
acid molecule, protein or organeile) has been substantially 
separated or purified away from other biological compo- 
nents in the cell of the organism in which the component 
naturally occurs, such as other chromosomal and extra¬ 
chromosomal DNA and RNA, proteins, and organeiles. 
Nucleic acids and proteins that have been “isolated” include 
nucleic acids and proteins purified by Standard purification 
methods. The term also embraces nucleic acids and proteins 
prepared by recombinant expression in a host cell, as well as 
chemically synthesized nucleic acids or proteins, or frag- 
ments thereof. 

Label: A detectable compound or composition that is 
conjugated directly or indirectly to another molecule to 
facilitate detection of that molecule. Specific, non-limiting 
examples of labels include fluorescent tags, enzymatic link- 
ages, and radioactive isotopes. 

Nucleic Acid Molecule: Apolymeric form of nucleotides, 
which may include both sense and anti-sense Strands of 
RNA, cDNA, genomic DNA, and synthetic forms and 
mixed polymers of the above. A nucleotide refers to a 
ribonucleotide, deoxynucleotide or a modified form of either 
type of nucleotide. A “nucleic acid molecule” as used herein 
is synonymous with “nucleic acid” and “polynucleotide.” A 
nucleic acid molecule is usually at least 10 bases in length, 
unless otherwise specified. The term includes single- and 
double-stranded forms of DNA. A polynucleotide may 
include either or both naturally occurring and modified 
nucleotides linked together by naturally occurring and/or 
non-naturally occurring nucleotide linkages. 

Oligonucleotide: A nucleic acid molecule generally com- 
prising a length of 300 bases or fewer. The term often refers 
to single-stranded deoxyribonucleotides, but it can refer as 
well to single- or double-stranded ribonucleotides, RNA: 
DNA hybrids and double-stranded DNAs, among others. 
The term “oligonucleotide” also includes oligonucleosides 
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(that is, an oligonucleotide minus the phosphate) and any 
other organic base polymer. In some examples, oligonucle- 
otides are about 10 to about 90 bases in length, for example, 
12, 13, 14, 15, 16, 17, 18, 19 or 20 bases in length. Other 
oligonucleotides are about 25, about 30, about 35, about 40, 
about 45, about 50, about 55, about 60 bases, about 65 bases, 
about 70 bases, about 75 bases or about 80 bases in length. 
Oligonucleotides may be single-stranded, for example, for 
use as probes or primers, or may be double-stranded, for 
example, for use in the construction of a mutant gene. 
Oligonucleotides can be either sense or anti-sense oligo¬ 
nucleotides. An oligonucleotide can be modified as dis- 
cussed above in reference to nucleic acid molecules. Oligo¬ 
nucleotides can be obtained from existing nucleic acid 
sources (for example, genomic or cDNA), but can also be 
synthetic (for example, produced by laboratory or in vitro 
oligonucleotide synthesis). 

Open Reading Frame (ORF): A series of nucleotide 
triplets (codons) coding for amino acids without any internal 
termination codons. These sequences are usually translat- 
able into a peptide/polypeptide/protein/polyprotein. 

Operably Linked: A first nucleic acid sequence is operably 
linked with a second nucleic acid sequence when the first 
nucleic acid sequence is placed in a functional relationship 
with the second nucleic acid sequence. For instance, a 
Promoter is operably linked to a coding sequence is the 
Promoter affects the transcription or expression of the cod¬ 
ing sequence. Generally, operably linked DNA sequences 
are contiguous and, where necessary to join two protein 
coding regions, in the same reading frame. If introns are 
present, the operably linked DNA sequences may not be 
contiguous. 

Pharmaceutically Acceptable Carriers: The pharmaceuti- 
cally acceptable carriers useful in this disclosure are con- 
ventional. Remington ’s Pharmaceutical Sciences, by E. W. 
Martin, Mack Publishing Co., Easton, Pa., 15th Edition 
(1975), describes compositions and formulations suitable for 
pharmaceutical delivery of one or ntore therapeutic Com¬ 
pounds or molecules, such as one or more SARS-CoV 
nucleic acid molecules, proteins or antibodies that bind these 
proteins, and additional pharmaceutical agents. 

In general, the nature of the carrier will depend on the 
particular mode of administration being employed. For 
instance, parenteral formulations usually comprise inject- 
able fluids that include pharmaceutically and physiologi- 
cally acceptable fluids such as water, physiological saline, 
balanced salt Solutions, aqueous dextrose, glycerol or the 
like as a vehicle. For solid compositions (for example, 
powder, pill, tablet, or capsule forms), conventional non- 
toxic solid carriers can include, for example, pharmaceutical 
grades of mannitol, lactose, starch, or magnesium stearate. 
In addition to biologically-neutral carriers, pharmaceutical 
compositions to be administered can contain minor amounts 
of non-toxic auxiliary substances, such as wetting or emul- 
sifying agents, preservatives, and pH buffering agents and 
the like, for example sodimn acetate or sorbitan monolau¬ 
rate. 

Polypeptide: A polymer in which the monomers are amino 
acid residues which are joined together through amide 
bonds. When the amino acids are alpha-amino acids, either 
the L-optical isomer or the D-optical isomer can be used, the 
L-isomers being preferred. The terms “polypeptide” or “pro- 
tein” as used herein are intended to encompass any amino 
acid sequence and include modified sequences such as 
glycoproteins. The term “polypeptide” is specifically 
intended to cover naturally occurring proteins, as well as 
those which are recombinantly or synthetically produced. 

Conservative amino acid substitutions are those substitu- 
tions that, when made, least interfere with the properties of 
the original protein, that is, the stracture and especially the 
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function of the protein is conserved and not significantly 
changed by such substitutions. Examples of conservative 
substitutions are shown below. 

5 


Original Residue 

Conservative Substitutions 

Ala 

Ser 

Arg 

Lys 

Asn 

Gin, His 

Asp 

Glu 

Cys 

Ser 

Gin 

Asn 

Glu 

Asp 

His 

Asn; Gin 

Ile 

Leu, Val 

Leu 

Ile; Val 

Lys 

Arg; Gin; Glu 

Met 

Leu; Ile 

Phe 

Met; Leu; Tyr 

Ser 

Thr 

Thr 

Ser 

Trp 

Tyr 

Tyr 

Trp; Phe 

Val 

Ile; Leu 


Conservative substitutions generally maintain (a) the 
25 structure of the polypeptide backbone in the area of the 
Substitution, for example, as a sheet or helical conformation, 
(b) the Charge or hydrophobicity of the molecule at the target 
site, or (c) the bulk of the side chain. 

The substitutions which in general are expected to pro- 
30 duce the greatest changes in protein properties will be 
non-conservative, for instance changes in which (a) a hydro- 
philic residue, for example, seryl or threonyl, is substituted 
for (or by) a hydrophobic residue, for example, leucyl, 
isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or 
proline is substituted for (or by) any other residue; (c) a 
J ' residue having an electropositive side chain, for example, 
lysyl, arginyl, or histadyl, is substituted for (or by) an 
electronegative residue, for example, glutamyl or aspartyl; 
or (d) a residue having a bulky side chain, for example, 
phenylalanine, is substituted for (or by) one not having a 
40 side chain, for example, glycine. 

Probes and Primers: A probe comprises an isolated 
nucleic acid attached to a detectable label or other reporter 
molecule. Typical labels include radioactive isotopes, 
enzyme Substrates, co-factors, ligands, chemiluminescent or 
45 fluorescent agents, haptens, and enzymes. Methods for 
labeling and guidance in the choice of labels appropriate for 
various purposes are discussed, for example, in Sambrook et 
al. (ed.), Molecular Cloning: A Laboratory Manual, 2" d ed., 
vol. 1-3, Cold Spring Harbor Laboratory Press, Cold Spring 
50 Harbor, N.Y., 1989 and Ausubel et al. Short Protocols in 
Molecular Biology, A‘ h ed., John Wiley & Sons, Inc., 1999. 

Primers are short nucleic acid molecules, for instance 
DNA oligonucleotides 10 nucleotides or more in length, for 
example that hybridize to contiguous complementary nucle- 
55 otides or a sequence to be amplified. Longer DNA oligo¬ 
nucleotides may be about 15, 20, 25, 30 or 50 nucleotides or 
more in length. Primers can be amiealed to a complementary 
target DNA Strand by nucleic acid hybridization to form a 
hybrid between the primer and the target DNA Strand, and 
then the primer extended along the target DNA Strand by a 
60 DNA polymerase enzyme. Primer pairs can be used for 
amplification of a nucleic acid sequence, for example, by the 
PCR or other nucleic-acid amplification methods known in 
the art, as described above. 

Methods for preparing and using nucleic acid probes and 
65 primers are described, for example, in Sambrook et al. (ed.), 
Molecular Cloning: A Laboratory> Manual, 2" d ed., vol. 1-3, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
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N.Y., 1989; Ausubel et al. Short Protocols in Molecular 
Biology, 4 th ed., John Wiley & Sons, Inc., 1999; and Innis et 
al. PCR Protocols, A Guide to Methods and Applications, 
Academic Press, Inc., San Diego, Calif., 1990. Amplifica- 
tion primer pairs can be derived from a known sequence, for 
example, by using Computer programs intended for that 
purpose such as Primer (Version 0.5, © 1991, Wliitehead 
Institute for Biomedical Research, Cambridge, Mass.). One 
of ordinary skill in the art will appreciate that the specificity 
of a particular probe or primer increases with its length. 
Thus, in Order to obtain greater specificity, probes and 
primers can be selected that comprise at least 20, 25, 30, 35, 
40, 45, 50 or more consecutive nucleotides of a target 
nucleotide sequences. 

Protein: Abiological molecule, particularly a polypeptide, 
expressed by a gene and comprised of amino acids. A 
“polyprotein” is a protein that, after synthesis, is cleaved to 
produce several functionally distinct polypeptides. 

Purified: The term “purified” does not require absolute 
purity; rather, it is intended as a relative term. Thus, for 
example, a purified protein preparation is one in which the 
subject protein is more pure than in its natural environment 
within a cell. Generally, a protein preparation is purified 
such that the protein represents at least 50% of the total 
protein content of the preparation. 

Recombinant Nucleic Acid: A sequence that is not natu- 
rally occurring or has a sequence that is made by an artificial 
combination of two otherwise separated Segments of 
sequence. This artificial combination is often accomplished 
by Chemical synthesis or, more commonly, by the artificial 
manipulation of isolated Segments of nucleic acids, for 
example, by genetic engineering techniques such as those 
described in Sambrook et al. (ed.), Molecular Cloning: A 
Laboratory Manual, 2” d ed., vol. 1-3, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989. The term 
recombinant includes nucleic acids that have been altered 
solely by addition, Substitution, or deletion of a portion of 
the nucleic acid. 

Sample: A portion, piece, or segment that is representative 
of a whole. This term encompasses any material, including 
for instance samples obtained from an animal, a plant, or the 
environment. 

An “environmental sample” includes a sample obtained 
from inanimate objects or reservoirs within an indoor or 
outdoor enviromnent. Environmental samples include, but 
are not limited to: soil, water, dust, and air samples; bulk 
samples, including building materials, fumiture, and landfill 
contents; and other reservoir samples, such as animal refiise, 
harvested grains, and foodstufFs. 

A “biological sample” is a sample obtained from a plant 
or animal subject. As used herein, biological samples 
include all samples useful for detection of viral infection in 
subjects, including, but not limited to: cells, tissues, and 
bodily fluids, such as blood; derivatives and fractions of 
blood (such as serum); extracted galls; biopsied or surgically 
removed tissue, including tissues that are, for example, 
unfixed, frozen, fixed in formalin and/or embedded in par- 
affin; tears; milk; skin scrapes; surface washings; urine; 
sputum; cerebrospinal fluid; prostate fluid; pus; bone mar- 
row aspirates; BAL; saliva; cervical swabs; vaginal swabs; 
and oropharyngeal wash. 

Sequence Identity: The similarity between two nucleic 
acid sequences, or two amino acid sequences, is expressed 
in terms of the similarity between the sequences, otherwise 
referred to as sequence identity. Sequence identity is fre- 
quently measured in terms of percentage identity (or simi¬ 
larity or homology); the higher the percentage, the more 
similar the two sequences are. 

Methods of aligmnent of sequences for comparison are 
well known in the art. Various programs and alignment 
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algorithms are described in: Smith and Watennan {Adv. 
Appl. Math., 2:482, 1981); Needleman and Wunsch (J. Mol. 
Biol., 48:443, 1970); Pearson and Lipman ( Proc. Natl. Acad. 
Sei., 85:2444, 1988); Higgins and Sharp (Gene, 73:237-44, 
5 1988); Higgins and Sharp (CABIOS, 5:151-53, 1989); Cor- 
pet et al. ( Nuc. Acids Res., 16:10881-90, 1988); Huang et al. 
(Comp. Appls Biosei., 8:155-65, 1992); and Pearson et al. 
(Meth. Mol. Biol., 24:307-31, 1994). Altschul et al. (Nature 
Genet., 6:119-29, 1994) presents a detailed consideration of 
„ sequence alignment methods and homology calculations. 

111 The aligmnent tools ALIGN (Myers and Miller, CABIOS 
4:11-17, 1989) or LFASTA (Pearson and Lipman, 1988) 
may be used to perform sequence comparisons (Internet 
Program© 1996, W. R. Pearson and the University of 
Virginia, “fasta20u63” Version 2.0u63, release date Decem- 
15 ber 1996). ALIGN compares entire sequences against one 
another, wliile LFASTA compares regions of local similarity. 
These aligmnent tools and their respective tutorials are 
available on the Internet at the NCSA website. Alternatively, 
for comparisons of amino acid sequences of greater than 
20 about 30 amino acids, the “Blast 2 sequences” function can 
be employed using the default BLOSUM62 matrix set to 
default parameters, (gap existence cost of 11 , and a per 
residue gap cost of 1). When aligning short peptides (fewer 
than around 30 amino acids), the aligmnent should be 
25 perfonned using the “Blast 2 sequences” function, employ- 
ing the PAM30 matrix set to default parameters (open gap 9, 
extension gap 1 penalties). The BLAST sequence compari¬ 
son system is available, for instance, from the NCBI web 
site; see also Altschul et al.,/. Mol. Biol., 215:403-10,1990; 
Gish. and States, Nature Genet., 3:266-72, 1993; Madden et 
al., Meth. Enzymol., 266:131-41, 1996; Altschul et al., 
Nucleic Acids Res., 25:3389-402, 1997; and Zhang and 
Madden, Genome Res., 7:649-56, 1997. 

Orthologs (equivalent to proteins of other species) of 
proteins are in some instances characterized by possession 
35 of greater than 75% sequence identity counted over the 
full-length alignment with the amino acid sequence of 
specific protein using ALIGN set to default parameters. 
Proteins with even greater similarity to a reference sequence 
will show increasing percentage identities when assessed by 
40 this method, such as at least 80%, at least 85%, at least 90%, 
at least 92%, at least 95%, or at least 98% sequence identity. 
In addition, sequence identity can be compared over the full 
length of one or both binding domains of the disclosed 
fiision proteins. 

45 When significantly less than the entire sequence is being 
compared for sequence identity, homologous sequences will 
typically possess at least 80% sequence identity over short 
Windows of 10-20, and may possess sequence identities of 
at least 85%, at least 90%, at least 95%, or at least 99% 
50 depending on their similarity to the reference sequence. 
Sequence identity over such short Windows can be deter- 
mined using LFASTA; methods are described at the NCSA 
website. One of skill in the art will appreciate that these 
sequence identity ranges are provided for guidance only; it 
is entirely possible that strongly significant homologs could 
55 be obtained that fall outside of the ranges provided. Similar 
homology concepts apply for nucleic acids as are described 
for protein. 

An alternative indication that two nucleic acid molecules 
are closely related is that the two molecules hybridize to 
60 each other linder stringent conditions. Representative 
hybridization conditions are discussed above. 

Nucleic acid sequences that do not show a high degree of 
identity may nevertheless encode similar amino acid 
sequences, due to the degeneracy of the genetic code. It is 
65 understood that changes in nucleic acid sequence can be 
made using this degeneracy to produce multiple nucleic acid 
sequences that each encode substantially the same protein. 



US 7,220,852 Bl 


15 

Specific Binding Agent: An agent that binds substantially 
only to a defined taiget. Thus a protein-specific binding 
agent binds substantially only the defined protein, or to a 
specific region within the protein. As used herein, a protein- 
specific binding agent includes antibodies and other agents 5 
that bind substantially to a specified polypeptide. The anti¬ 
bodies may be monoclonal or polyclonal antibodies that are 
specific for the polypeptide, as well as iminunologically 
effective portions (“fragments”) thereof. 

The determination that a particular agent binds substan- lfJ 
tially only to a specific polypeptide may readily be made by 
using or adapting routine procedures. One suitable in vitro 
assay makes use of tlie Western blotting procedure (de- 
scribed in many Standard texts, including Harlow and Lane, 
Using Antibodies: A Laboratory Manual, CSHL, New York, 
1999). 15 

Transformed: A “transformed” cell is a cell into which has 
been introduced a nucleic acid molecule by molecular 
biology techniques. The term encompasses all techniques by 
which a nucleic acid molecule might be introduced into such 
a cell, including transfection with viral vectors, transforma- 20 
tion with plasmid vectors, and introduction of naked DNA 
by electroporation, lipofection, and particle gun accelera- 
tion. 

Vector: A nucleic acid molecule as introduced into a host 
cell, thereby producing a transformed host cell. A vector 25 
may include nucleic acid sequences that permit it to replicate 
in a host cell, such as an origin of replication. A vector may 
also include one or more selectable marker genes and other 
genetic elements known in the art. 

Virus: Microscopic infectious organism that reproduces 30 
inside living cells. A virus typically consists essentially of a 
core of a single nucleic acid surrounded by a protein coat, 
and has the ability to replicate only inside a living cell. 
“Viral replication” is the production of additional virus by 
the occurrence of at least one viral life cycle. A virus may 35 
subvert the host cells’ normal functions, causing the cell to 
behave in a manner determined by the virus. For example, 
a viral infection may result in a cell producing a cytokine, or 
responding to a cytokine, when the uninfected cell does not 
normally do so. 

“Coronaviruses” are large, enveloped, RNA vimses that 411 
cause respiratory and enteric diseases in humans and other 
animals. Coronavirus genomes are non-segmented, single- 
stranded, positive-sense RNA, approximately 27-31 kb in 
length. Genomes have a 5' methylated cap and 3' poly-Atail, 
and function directly as mRNA. Host cell entry occurs via 45 
endocytosis and membrane fiision, and replication occurs in 
the cytoplasm. Initially, the 5' 20 kb of the positive-sense 
genome is translated to produce a viral polymerase, which 
then produces a full-length negative-sense Strand used as a 
template to produce subgenomic mRNA as a “nested set” of 50 
transcripts. Assembly occurs by budding into the golgi 
apparatus, and particles are transported to the surface of the 
cell and released. 

III. OverView of Several Embodiments 

A newly isolated human Coronavirus (SARS-CoV) is 
disclosed herein. The entire genomic nucleic acid sequence 
of this virus is also provided herein. Also disclosed are the 
nucleic acid sequences of the SARS-CoV ORFs, and the 
polypeptide sequences encoded by these ORFs. Pharmaceu- 
tical and i mmu ne stimulatory compositions are also dis- 60 
closed that include one or more SARS-CoV viral nucleic 
acids, polypeptides encoded by these viral nucleic acids and 
antibodies that bind to a SARS-CoV polypeptide or SARS- 
CoV polypeptide fragment. 

In one embodiment, a method is provided for detecting 65 
the presence of SARS-CoV in a sample. Hiis method 
includes contacting the sample with a pair of nucleic acid 
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primers that hybridize to a SARS-CoV nucleic acid, wherein 
at least one primer is 5'-end labeled with a reporter dye, 
amplifying the SARS-CoV nucleic acid or a fragment 
thereof from the sample utilizing the pair of nucleic acid 
primers, electrophoresing the amplified products, and 
detecting the 5'-end labeled reporter dye, thereby detecting 
a SARS-CoV. In a specific, non-limiting example, the ampli- 
fication utilizes RT-PCR. In a further specific example of the 
provided method, at least one of the nucleic acid primers that 
hybridize to a SARS-CoV nucleic acid includes a sequence 
as set forth in any one of SEQ ID NOs: 13-15. 

In another example of the provided method, detecting the 
presence of SARS-CoV in a sample includes contacting the 
sample with a pair of nucleic acid primers that hybridize to 
a SARS-CoV nucleic acid, amplifying the SARS-CoV 
nucleic acid or a fragment thereof from the sample utilizing 
the pair of nucleic acid primers, adding to the amplified 
SARS-CoV nucleic acid or the fragment thereof a TaqMan 
SARS-CoV probe that hybridizes to the SARS-CoV nucleic 
acid, wherein the TaqMan SARS-CoV probe is labeled with 
a 5'-reporter dye and a 3'-quencher dye, performing one or 
more additional rounds of amplification, and detecting fluo¬ 
rescence of the 5'-reporter dye, thereby detecting a SARS- 
CoV. In a specific, non-limiting example, the amplification 
utilizes RT-PCR. In a further specific example of the pro¬ 
vided method, at least one of the nucleic acid primers that 
hybridize to a SARS-CoV nucleic acid and/or the TaqMan 
SARS-CoV probe that hybridizes to the SARS-CoV nucleic 
acid includes a sequence as set forth in any one of SEQ ID 
NOs: 16-33. 

In another embodiment, a method is provided for detect¬ 
ing a SARS-CoV in a biological sample that contains 
antibodies. This method includes contacting the biological 
sample with a SARS-CoV-specific antigen, wherein the 
antigen includes a SARS-CoV organism and determining 
whether a binding reaction occurs between the SARS-CoV- 
specific antigen and an antibody in the biological sample if 
such is present, thereby detecting SARS-CoV. 

In a further embodiment, a method is provided for detect¬ 
ing a SARS-CoV in a biological sample that contains 
polypeptides and/or fragments thereof. This method 
includes contacting the biological sample with a SARS- 
CoV-specific antibody and determining whether a binding 
reaction occurs between the SARS-CoV-specific antibody 
and a SARS-CoV polypeptide or fragment thereof in the 
biological sample if such is present, thereby detecting 
SARS-CoV. In a specific, non-limiting example, determin¬ 
ing whether a binding reaction occurs between the SARS- 
CoV-specific antibody and a SARS-CoV polypeptide or 
fragment thereof is carried out in situ or in a tissue sample. 
In a further specific example, determining whether a binding 
reaction occurs between the SARS-CoV-specific antibody 
and a SARS-CoV polypeptide or fragment thereof includes 
an immunohistochemical assay. 

An additional embodiment includes a kit for detecting a 
SARS-CoV in a sample, including a pair of nucleic acid 
primers that hybridize linder stringent conditions to a SARS- 
CoV nucleic acid, wherein one primer is 5'-end labeled with 
a reporter dye. In a specific, non-limiting example, at least 
one of the nucleic acid primers that hybridize to a SARS- 
CoV nucleic acid includes a sequence as set forth in any one 
of SEQ ID NOs: 13-15. 

Another example of the provided kit includes a pair of 
nucleic acid primers that hybridize linder high stringency 
conditions to a SARS-CoV nucleic acid and a TaqMan 
SARS-CoV probe that hybridizes to the SARS-CoV nucleic 
acid, wherein the TaqMan SARS-CoV probe is labeled with 
a 5'-reporter dye and a 3'-quencher dye. In a specific, 
non-limiting example, at least one of the nucleic acid 
primers that hybridize to a SARS-CoV nucleic acid and/or 



US 7,220,852 Bl 


17 

the TaqMan SARS-CoV probe that hybridizes to the SARS- 
CoV nucleic acid includes a sequence as set forth in any one 
ofSEQ ID NOs: 16-33. 

Also disclosed herein is a composition including an 
isolated SARS-CoV organism. In one embodiment, the 
isolated SARS-CoV organism is an inactive isolated SARS- 
CoV organism. In another embodiment, the composition 
includes at least one component selected front the group 
consisting of phannaceutically acceptable carriers, adju- 
vants and combinations of two or more thereof. In yet 
another embodiment, the composition is introduced into a 
subject, thereby eliciting an immune response against a 
SARS-CoV antigenic epitope in a subject. 

IV. SARS-CoV Nucleotide and Amino Acid Sequences 

The current disclosure provides an isolated SARS-CoV 
genome, isolated SARS-CoV polypeptides, and isolated 
nucleic acid molecules encoding the same. In one embodi¬ 
ment, the isolated SARS-CoV genome has a sequence as 
shown in SEQ ID NO: 1 or an equivalent thereof. Poly- 
nucleotides encoding a SARS-CoV polypeptide (encoded by 
an ORF from within the genome) are also provided, and are 
termed SARS-CoV nucleic acid molecules. These nucleic 
acid molecules include DNA, cDNA and RNA sequences 
which encode a SARS-CoV polypeptide. Specific, non- 
limiting examples of a SARS-CoV nucleic acid molecule 
encoding an ORF are nucleic acid 265 to nucleic acid 13,398 
ofSEQ ID NO: 1 (encoding SARS-CoV la, SEQ ID NO: 2), 
nucleic acid 13,398 to 21,482 of SEQ ID NO: 1 (encoding 
SARS-CoV lb, SEQ ID NO: 3), nucleic acid 21,492 to 
25,256 ofSEQ ID NO: 1 (encoding SARS-CoV S, SEQ ID 
NO: 4), nucleic acid 25,268 to 26,089 of SEQ ID NO: 1 
(encoding SARS-CoV XI, SEQ ID NO: 5), nucleic acid 
25,689 to 26,150 ofSEQ ID NO: 1 (encoding SARS-CoV 
X2, SEQ ID NO: 6 ), nucleic acid 26,117 to 26,344 ofSEQ 
ID NO: 1 (encoding SARS-CoV E, SEQ ID NO: 7), nucleic 
acid 26,398 to 27,060 of SEQ ID NO: 1 (encoding SARS- 
CoV M, SEQ ID NO: 8 ), nucleic acid 27,074 to 27,262 of 
SEQ ID NO: 1 (encoding SARS-CoV X3, SEQ ID NO: 9), 
nucleic acid 27,273 to 27,638 of SEQ ID NO: 1 (encoding 
SARS-CoV X4, SEQ ID NO: 10), nucleic acid 27,864 to 
28,115 ofSEQ ID NO: 1 (encoding SARS-CoV X5, SEQ ID 
NO: 11), and nucleic acid 28,120 to 29,385 ofSEQ ID NO: 
1 (encoding SARS-CoV N, SEQ ID NO: 12). 

Oligonucleotide primers and probes derived from the 
SARS-CoV genome (SEQ ID NO: 1) are also encompassed 
within the scope of the present disclosure. Such oligonucle¬ 
otide primers and probes may comprise a sequence of at 
least about 15 consecutive nucleotides of the SARS-CoV 
nucleic acid sequence, such as at least about 20, 25, 30, 35, 
40,45, or 50 or more consecutive nucleotides. These primers 
and probes may be obtained from any region of the disclosed 
SARS-CoV genome (SEQ ID NO: 1), including particularly 
from any of the ORFs disclosed herein. Specific, non- 
limiting examples of oligonucleotide primers derived from 
the SARS-CoV genome (SEQ ID NO: 1) include: Cor-p-F2 
(SEQ ID NO: 13), Cor-p-F3 (SEQ ID NO: 14), Cor-p-Rl 
(SEQ ID NO: 15), SARS1-F (SEQ ID NO: 16), SARS1-R 
(SEQ ID NO: 17), SARS2-F (SEQ ID NO: 19), SARS2-R 
(SEQ ID NO: 20), SARS3-F (SEQ ID NO: 22), SARS3-R 
(SEQ ID NO: 23), N3-F (SEQ ID NO: 25), N3-R (SEQ ID 
NO: 26), 3'NTR-F (SEQ ID NO: 28), 3'NTR-R (SEQ ID 
NO: 29), M-F (SEQ ID NO: 31), and M-R (SEQ ID NO: 32). 
Specific, non-limiting examples of oligonucleotide probes 
derived from the SARS-CoV genome (SEQ ID NO: 1) 
include: SARS1-P (SEQ ID NO: 18), SARS2-P (SEQ ID 
NO: 21), SARS3-P (SEQ ID NO: 24), N3-P (SEQ ID NO: 
27), 3'NTR-P (SEQ ID NO: 30), and M-P (SEQ ID NO: 33). 

Nucleic acid molecules encoding a SARS-CoV polypep¬ 
tide can be operatively linked to regulatory sequences or 
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elements. Regulatory sequences or elements include, but are 
not limited to Promoters, enhancers, transcription termina- 
tors, a Start codon (for example, ATG), stop codons, and the 
like. 

5 Additionally, nucleic acid molecules encoding a SARS- 
CoV polypeptide can be inserted into an expression vector. 
Specific, non-limiting examples of vectors include, plas- 
mids, bacteriophages, cosmids, animal viruses and yeast 
artificial chromosomes (YACs) (Burke et al., Science 236: 
806-12, 1987). Such vectors may then be introduced into a 
variety of hosts including somatic cells, and simple or 
complex organisms, such as bacteria, fungi (Timberlake and 
Marshall, Science 244:1313-17, 1989), invertebrates, plants 
(Gasser et al., Plant Cell 1:15-24, 1989), and animals 
(Pursei et al., Science 244:1281-88, 1989). 

15 Transformation of a host cell with an expression vector 
carrying a nucleic acid molecule encoding a SARS-CoV 
polypeptide may be carried out by conventional techniques, 
as are well known to those skilled in the art. By way of 
example, where the host is prokaryotic, such as E. coli, 
20 competent cells that are capable of DNA uptake can be 
prepared from cells harvested after exponential growth 
phase and subsequently treated by the CaCl 2 method using 
procedures well known in the art. Altematively, MgCl, or 
RbCl can be used. Transformation can also be performed 
25 after forming a protoplast of the host cell if desired, or by 
electroporation. 

When the host is a eukaryote, methods of transfection of 
DNA, such as calcium phosphate coprecipitates, conven¬ 
tional mechanical procedures such as microinjection, elec- 
30 troporation, insertion of a plasmid encased in liposomes, or 
virus vectors, may be used. Eukaryotic cells can also be 
cotransformed with SARS-CoV nucleic acid molecules, and 
a second foreign DNA molecule encoding a selectable 
phenotype, such as the herpes simplex thymidine kinase 
gene. Another method is to use a eukaryotic viral vector, 
35 such as simian virus 40 or bovine papilloma virus, to 
transiently infect or transform eukaryotic cells and express 
the protein (see, for example, Eukaryotic Viral Vectors, Cold 
Spring Harbor Laboratory, Gluzman ed., 1982). 

The SARS-CoV polypeptides of this disclosure include 
40 proteins encoded by any of the ORFs disclosed herein, and 
equivalents thereof. Specific, non-limiting examples of 
SARS-CoV proteins are provided in SEQ ID NOs: 2-12. 
Fusion proteins are also contemplated that include a heter- 
ologous amino acid sequence chemically linked to a SARS- 
45 CoV polypeptide. Exemplary heterologous sequences 
include short amino acid sequence tags (such as six histidine 
residues), as well a fusion of other proteins (such as c-myc 
or green fluorescent protein fusions). Epitopes of the SARS- 
CoV proteins, that are recognized by an antibody or that bind 
50 the major histocompatibility complex, and can be used to 
induce a SARS-CoV-specific immune response, are also 
encompassed by this disclosure. 

Methods for expressing large amounts of protein from a 
cloned gene introduced into E. coli may be utilized for the 
.. purification and functional analysis of proteins. For 
example, fusion proteins consisting of amino terminal pep- 
tides encoded by a portion of the E. coli lacZ or trpE gene 
linked to SARS-CoV proteins may be used to prepare 
polyclonal and monoclonal antibodies against these pro¬ 
teins. 

60 Intact native protein may also be produced in E. coli in 
large amounts for functional studies. Methods and plasmid 
vectors for producing fusion proteins and intact native 
proteins in bacteria are described by Sambrook et al. (ed.), 
Molecular Cloning: A Laboratory Manual, 2” </ ed., vol. 1-3, 
65 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
N.Y., 1989. Such fusion proteins may be made in large 
amounts, are easy to purify, and can be used to elicit 
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antibody response. Native proteins can be produced in 
bacteria by placing a strong, regulated promoter and an 
efiScient ribosome-binding site upstream of the cloned gene. 

If low levels of protein are produced, additional Steps may 
be taken to increase protein production; if high levels of 5 
protein are produced, purification is relatively easy. Suitable 
methods are presented by Sambrook et al. (ed.), Molecular 
Cloning: A Laboratory Manual, 2P d ed., vol. 1-3, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 
1989, and are well known in the art. Offen, proteins 
expressed at high levels are found in insoluble inclusion 
bodies. Methods for extracting proteins from these aggre- 
gates are described by Sambrook et al. (ed.), Molecular 
Cloning: A Laboratory Manual, 2" d ed., vol. 1-3, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 
1989. 15 

Isolation and purification of recombinantly expressed 
proteins may be carried out by conventional means includ- 
ing preparative chromatography and immunological sepa- 
rations. Additionally, the proteins can be chemically synthe- 
sized by any of a number of manual or automated methods 20 
of synthesis known in the art. 

V. Specific Binding Agents 

The disclosure provides specific binding agents that bind 
to SARS-CoV polypeptides disclosed herein. The binding 2 5 
agent may be useful for purifying and detecting the polypep¬ 
tides, as well as for detection and diagnosis of SARS-CoV. 
Examples of the binding agents are a polyclonal or mono¬ 
clonal antibody, and fragments thereof, that bind to any of 
the SARS-CoV polypeptides disclosed herein. 

Monoclonal or polyclonal antibodies may be raised to 
recognize a SARS-CoV polypeptide described herein, or a 
fragment or variant thereof. Optimally, antibodies raised 
against these polypeptides would specifically detect the 
polypeptide with which the antibodies are generated. That is, 
antibodies raised against a specific SARS-CoV polypeptide 35 
will recognize and bind that polypeptide, and will not 
substantially recognize or bind to other polypeptides or 
antigens. The determination that an antibody specifically 
binds to a target polypeptide is made by any one of a number 
of Standard immunoassay methods; for instance, the Western 40 
blotting technique (Sambrook et al. (ed.), Molecular Clon¬ 
ing: A Laboratory Manual, 2 nd ed., vol. 1-3, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989). 

Substantially pure SARS-CoV recombinant polypeptide 
antigens suitable for use as immunogen may be isolated 45 
from the transformed cells described above, using methods 
well known in the art. Monoclonal or polyclonal antibodies 
to the antigens may then be prepared. 

Monoclonal antibodies to the polypeptides can be pre¬ 
pared from murine hybridomas according to the classic 50 
method of Kollier & Milstein (Nature 256:495-97, 1975), or 
a derivative method thereof. Briefly, a mouse is repetitively 
inoculated with a few micrograms of the selected protein 
immunogen (for example, a polypeptide comprising at least 
one SARS-CoV-specific epitope, a portion of a polypeptide .. 
comprising at least one SARS-CoV-specific epitope, or a 
synthetic peptide comprising at least one SARS-CoV-spe- 
cific epitope) over a period of a few weeks. The mouse is 
then sacrificed, and the antibody-producing cells of the 
spieen isolated. The spieen cells are fused by means of 
polyethylene glycol with mouse myeloma cells, and the 60 
excess unfused cells destroyed by growth of the System on 
selective media comprising aminopterin (HAT media). The 
successfully fused cells are diluted and aliquots of the 
dilution placed in wells of a microtiter plate where growth 
of the culture is continued. Antibody-producing clones are 65 
identified by detection of antibody in the supematant fluid of 
the wells by immunoassay procedures, such as ELISA, as 
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originally described by Engvall ( Meth. Enzymol., 
70:419-39, 1980), or a derivative method thereof. Selected 
positive clones can be expanded and their monoclonal 
antibody product harvested for use. Detailed procedures for 
monoclonal antibody production are described in Harlow 
and Laue, Using Antibodies: A Laboratory’ Manual , CSHL, 
New York, 1999. 

Polyclonal antiserum containing antibodies can be pre¬ 
pared by immunizing suitable animals with a polypeptide 
comprising at least one SARS-CoV-specific epitope, a por¬ 
tion of a polypeptide comprising at least one SARS-CoV- 
specific epitope, or a synthetic peptide comprising at least 
one SARS-CoV-specific epitope, which can be unmodified 
or modified, to enhance immunogenicity. 

Effective antibody production (whether monoclonal or 
polyclonal) is affected by many factors related both to the 
antigen and the host species. For example, small molecules 
tend to be less immunogenic than others and may require the 
use of carriers and adjuvant. Also, host animals vary in 
response to site of inoculations and dose, with either inad¬ 
equate or excessive doses of antigen resulting in low titer 
antisera. Small doses (ng level) of antigen administered at 
multiple intradermal sites appear to be most reliable. An 
effective immunization protocol for rabbits can be found in 
Vaitukaitis et al. ( J. Clin. Endocrinol. Metab.. 33:988-91, 
1971). 

Booster injections can be given at regulär intervals, and 
antiserum harvested when the antibody titer thereof, as 
determined semi-quantitatively, for example, by double 
immunodiflusion in agar against known concentrations of 
the antigen, begins to fall. See, for example, Ouchterlony et 
al., Handbook of Experimental Immunology , Wier, D. (ed.), 
Chapter 19, Blackwell, 1973. A plateau concentration of 
antibody is usually in the ränge of 0.1 to 0.2 mg/ml of serum 
(about 12 pM). Aiiinity of the antisera for the antigen is 
determined by preparing competitive binding curves, as 
described, for example, by Fisher (Manual of Clinical 
Immunology\ Ch. 42, 1980). 

.Antibody fragments may be used in place of whole 
antibodies and may be readily expressed in prokaryotic host 
cells. Methods of making and using immunologically effec- 
tive portions of monoclonal antibodies, also referred to as 
“antibody fragments,” are well known and include those 
described in Better & Horowitz, Methods Enzymol. 178: 
476-96, 1989; Glockshuber et al., Biochemistry 

29:1362-67, 1990; andU.S. Pat. No. 5,648,237 (Expression 
of Functional .Antibody Fragments); U.S. Pat. No. 4,946,778 
(Single Polypeptide Chain Binding Molecules); and U.S. 
Pat. No. 5,455,030 (Immunotherapy Using Single Chain 
Polypeptide Binding Molecules), and references cited 
therein. Conditions whereby a polypeptide/binding agent 
complex can form, as well as assays for the detection of the 
formation of a polypeptide/binding agent complex and quan- 
titation of binding afünities of the binding agent and 
polypeptide, are Standard in the art. Such assays can include, 
but are not limited to, Western blotting, immunoprecipita- 
tion, immmiofluorescence, immunocytochemistry, immuno- 
histochemistry, fluorescence activated cell sorting (FACS), 
fluorescence in situ hybridization (FISH), immunomagnetic 
assays, ELISA, ELISPOT (Coligan et al., Current Protocols 
in Immunology, Wiley, NY, 1995), agglutination assays, 
flocculation assays, cell panning, and the like, as are well 
known to one of skill in the art. 

Binding agents of this disclosure can be bound to a 
Substrate (for example, beads, tubes, slides, plates, nitrocel- 
lulose sheets, and the like) or conjugated with a detectable 
moiety, or both bound and conjugated. The detectable moi- 
eties contemplated for the present disclosure can include, 
but are not limited to, an immunofluorescent moiety (for 
example, fluorescein, rhodamine), a radioactive moiety (for 
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example, 32 P, 125 I, 35 S), an enzyme moiety (for example, 
horseradish peroxidase, alkaline phosphatase), a colloidal 
gold moiety, and a biotin moiety. Such conjugation tech- 
niques are Standard in the art (for example, see Harlow and 
Lane, Using Antibodies: A Laboratory Manual, CSHL, New 
York, 1999; Yang et al, Nature, 382:319-24, 1996). 

VI. Detection and Diagnosis of SARS-CoV 

A. Nucleic Acid Based Methods of Detection and Diagnosis 

A major application of the SARS-CoV sequence infor- 
mation presented herein is in the area of detection and 
diagnostic testing for SARS-CoV infection. Methods for 
Screening a subject to detennine if the subject has been or is 
currently infected with SARS-CoV are disclosed herein. 

One such method includes providing a sample, which 
sample includes a nucleic acid such as DNA or RNA, and 
providing an assay for detecting in the sample the presence 
of a SARS-CoV nucleic acid molecule. Suitable samples 
include all biological samples useful for detection of viral 
infection in subjects, including, but not limited to, cells, 
tissues (for example, lung and kidney), bodily fluids (for 
example, blood, serum, urine, saliva, sputum, and cere¬ 
brospinal fluid), bone marrow aspirates, BAL, and oropha¬ 
ryngeal wash. Additional suitable samples include all envi¬ 
ronmental samples useful for detection of viral presence in 
the environment, including, but not limited to, a sample 
obtained from inanimate objects or reservoirs within an 
indoor or outdoor environment. The detection in the sample 
of a SARS-CoV nucleic acid molecule may be perfonned by 
a nurnber of methodologies, non-limiting examples of which 
are outlined below. 

In one embodiment, detecting in the sample the presence 
of a SARS-CoV nucleic acid molecule includes the ampli- 
fication of a SARS-CoV nucleic acid sequence (or a frag- 
ment thereof). Any nucleic acid amplification method can be 
used. In one specific, non-limiting example, PCR is used to 
amplify the SARS-CoV nucleic acid sequence(s). In another 
non-limiting example, RT-PCR can be used to amplify the 
SARS-CoV nucleic acid sequences. In an additional non- 
limiting example, transcription-mediated amplification can 
be used to amplify the SARS-CoV nucleic acid sequences. 

In some embodiments, a pair of SARS-CoV-specific 
primers are utilized in the amplification reaction. One or 
both of the primers can be end-labeled (for example, radio- 
labeled, fluoresceinated, or biotinylated). In one specific, 
non-limiting example, at least one of the primers is 5'-end 
labeled with the reporter dye 6 -carboxyfluorescein ( 6 -FAM). 
The pair of primers includes an upstream primer (which 
binds 5' to the downstream primer) and a downstream primer 
(which binds 3' to the upstream primer). In one embodiment, 
either the upstream primer or the downstream primer is 
labeled. Specific, non-limiting examples of SARS-CoV- 
specific primers include, but are not limited to: Cor-p-F2 
(SEQ ID NO: 13), Cor-p-F3 (SEQ ID NO: 14), Cor-p-Rl 

(SEQ ID NO: 15), SARS1-F (SEQ ID NO: 16), SARS1-R 

(SEQ ID NO: 17), SARS2-F (SEQ ID NO: 19), SARS2-R 

(SEQ ID NO: 20), SARS3-F (SEQ ID NO: 22), SARS3-R 

(SEQ ID NO: 23), N3-F (SEQ ID NO: 25), N3-R (SEQ ID 

NO: 26), 3'NTR-F (SEQ ID NO: 28), 3'NTR-R (SEQ ID 
NO: 29), M-F (SEQ ID NO: 31), and M-R (SEQ ID NO: 32). 
Additional primer pairs can be generated, for instance, to 
amplify any of the specific ORFs described herein, using 
well known primer design principles and methods. 

In one specific, non-limiting example, electrophoresis is 
used to detect amplified SARS-CoV-specific sequences. 
Electrophoresis can be automated using many methods well 
know in the art. In one embodiment, a genetic analyzer is 
used, such as an ABI 3100 Prism Genetic Analyzer (PE 
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Applied Biosystems, Foster City, Calif.), wherein the bands 
are analyzed using GeneScan Software (PE Applied Biosys¬ 
tems, Foster City, Calif.). 

In another specific, non-limiting example, hybridization 
5 assays are used to detect amplified SARS-CoV-specific 
sequences using distinguishing oligonucleotide probes. 
Such probes include “TaqMan” probes. TaqMan probes 
consist of an oligonucleotide with a reporter at the 5'-end and 
a quencher at the 3'-end. In one specific, non-limiting 
example, the reporter is 6 -FAM and the quencher is Black¬ 
hole Quencher (BiosearchTech., Inc., Novato, Calif.). Wlien 
the probe is intact, the proximity of the reporter to the 
quencher results in suppression of reporter fluorescence, 
primarily by fluorescence resonance energy transfer. If the 
target of interest is present, the TaqMan probe specifically 
15 hybridizes between the forward and reverse primer sites 
during the PCR annealing step. In the process of PCR 
elongation, the 5'-3' nucleolytic activity of the Taq DNA 
polymerase cleaves the hybridized probe between the 
reporter and the quencher. The probe fragments are then 
20 displaced from the target, and polymerization of the Strand 
continues. Taq DNA polymerase does not cleave non-hy- 
bridized probe, and cleaves the hybridized probe only during 
polymerization. The 3'-end of the probe is blocked to 
prevent extension of the probe during PCR. The 5'-3' 
25 nuclease cleavage of the hybridized probe occurs in every 
cycle and does not interfere with the exponential accumu- 
lation of PCR product. Accumulation of PCR products is 
detected directly by monitoring the increase in fluorescence 
of the released reporter. The increase in fluorescence signal 
is detected only if the target sequence is complementary to 
the probe and is amplified during PCR. Therefore, non- 
specific amplification is not detected. SARS-CoV-specific 
TaqMan probes of the present disclosure include, but are not 
limited to: SARS1-P (SEQ ID NO: 18), SARS2-P (SEQ ID 
NO: 21), SARS3-P (SEQ ID NO: 24), N3-P (SEQ ID NO: 
35 27), 3'NTR-P (SEQ ID NO: 30), and M-P (SEQ ID NO: 33), 
and hybridization assays include, but are not limited to, a 
real-time RT-PCR assay. 

B. Protein Based Methods of Detection and Diagnosis 
The present disclosure further provides methods of detect - 
411 ing a SARS-CoV antigen in a sample, and/or diagnosing 
SARS-CoV infection in a subject by detecting a SARS-CoV 
antigen. Examples of such methods comprise contacting the 
sample with a SARS-CoV-specific binding agent under 
conditions whereby an antigen/binding agent complex can 
45 form; and detecting formation of the complex, thereby 
detecting SARS-CoV antigen in a sample and/or diagnosing 
SARS-CoV infection in a subject. It is contemplated that at 
least certain antigens will be on an intact SARS-CoV virion, 
will be a SARS-CoV-encoded protein displayed on the 
50 surface of a SARS-CoV-infected cell expressing the antigen, 
or will be a Fragment of the antigen. Contemplated samples 
subject to analysis by these methods can comprise any 
sample, such as a clinical sample, useful for detection of 
viral infection in a subject. 

55 Methods for detecting antigens in a sample are discussed, 
for example, in Ausubel et al. Short Protocols in Molecular 
Biology, 4' 1 '' ed., John Wiley & Sons, Inc., 1999. Enzyme 
immunoassays such as IFA, ELISA and immunoblotting can 
be readily adapted to accomplish the detection of SARS- 
CoV antigens according to the methods of this disclosure. 
60 An ELISA method eflective for the detection of soluble 
SARS-CoV antigens is the direct competitive ELISA. This 
method is most useful when a specific SARS-CoV antibody 
and purified SARS-CoV antigen are available. Briefly: 1) 
coat a Substrate (for example, a microtiter plate) with a 
65 sample suspected of containing a SARS-CoV antigen; 2) 
contact the boimd SARS-CoV antigen with a SARS-CoV- 
specific antibody bound to a detectable moiety (for example, 
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horseradish peroxidase enzyme or alkaline phosphatase 
enzyme); 3) add purified inhibitor SARS-CoV antigen; 4) 
contact the above with the Substrate for the enzyme; and 5) 
observe/measure inhibition of color change or fluorescence 
and quantitate antigen concentration (for example, using a 5 
microtiter plate reader). 

An additional ELISAmethod effective for the detection of 
soluble SARS-CoV antigens is the antibody-sandwich 
ELISA. This method is frequently more sensitive in detect- 
ing antigen than the direct competitive ELISA method. 
Briefly: 1) coat a Substrate (for example, a microtiter plate) 
with a SARS-CoV-specific antibody; 2) contact the bound 
SARS-CoV antibody with a sample suspected of containing 
a SARS-CoV antigen; 3) contact the above with SARS- 
CoV-specific antibody bound to a detectable moiety (for 
example, horseradish peroxidase enzyme or alkaline phos- 15 
phatase enzyme); 4) contact the above with the Substrate for 
the enzyme; and 5) observe/measure color change or fluo¬ 
rescence and quantitate antigen concentration (for example, 
using a microtiter plate reader). 

An ELISA method effective for the detection of cell- 20 
surface SARS-CoV antigens is the direct cellular ELISA. 
Briefly, cells suspected of exhibiting a cell-surface SARS- 
CoV antigen are fixed (for example, using glutaraldehyde) 
and incubated with a SARS-CoV-specific antibody bound to 
a detectable moiety (for example, horseradish peroxidase 25 
enzyme or alkaline phosphatase enzyme). Following a wash 
to remove unbound antibody, Substrate for the enzyme is 
added and color change or fluorescence is observed/mea- 
sured. 

The present disclosure further provides methods of detect- 30 
ing a SARS-CoV-reactive antibody in a sample, and/or 
diagnosing SARS-CoV infection in a subject by detecting a 
SARS-CoV-reactive antibody. Examples of such methods 
comprise contacting the sample with a SARS-CoV polypep- 
tide of this disclosure under conditions whereby a polypep- „ 
tide/antibody coinplex can form; and detecting formation of 
the complex, thereby detecting SARS-CoV antibody in a 
sample and/or diagnosing SARS-CoV infection in a subject. 
Contemplated samples subject to analysis by these methods 
can comprise any sample, such as a clinical sample, as 
described herein as being useful for detection of viral 40 
infection in a subject. 

Methods for detecting antibodies in a sample are dis- 
cussed, for example, in Ausubel et al. Short Protocols in 
Molecular Biology, 4 th ed., John Wiley & Sons, Inc., 1999. 
Enzyme immunoassays such as IFA, ELISA and immuno- 45 
blotting can be readily adapted to accomplish the detection 
of SARS-CoV antibodies according to the methods of this 
disclosure. An ELISA method eflective for the detection of 
specific SARS-CoV antibodies is the indirect ELISA 
method. Briefly: 1) bind a SARS-CoV polypeptide to a 50 
Substrate (for example, a microtiter plate; 2) contact the 
bound polypeptide with a sample suspected of containing 
SARS-CoV antibody; 3) contact the above with a secondary 
antibody bound to a detectable moiety which is reactive with 
the bound antibody (for example, horseradish peroxidase 5 _ 
enzyme or alkaline phosphatase enzyme); 4) contact the 
above with the Substrate for the enzyme; and 5) observe/ 
measure color change or fluorescence. 

Another immunologic technique that can be useful in the 
detection of SARS-CoV antibodies uses monoclonal anti¬ 
bodies for detection of antibodies specifically reactive with 60 
SARS-CoV polypeptides in a competitive inhibition assay. 
Briefly, a sample suspected of containing SARS-CoV anti¬ 
bodies is contacted with a SARS-CoV polypeptide of this 
disclosure which is bound to a Substrate (for example, a 
microtiter plate). Excess sample is thoroughly washed away. 65 
A labeled (for example, enzyme-linked, fluorescent, radio- 
active, and the like) monoclonal antibody specific for the 


24 

SARS-CoV polypeptide is then contacted with any previ- 
ously fonned polypeptide-antibody complexes and the 
amount of monoclonal antibody binding is measured. The 
amount of inhibition of monoclonal antibody binding is 
measured relative to a control (110 monoclonal antibody), 
allowing for detection and measurement of antibody in the 
sample. The degree of monoclonal antibody inhibition can 
be a very specific assay for detecting SARS-CoV. Mono¬ 
clonal antibodies can also be used for direct detection of 
SARS-CoV in cells or tissue samples by, for example, IFA 
analysis according to Standard methods. 

As a further example, a micro-agglutination test can be 
used to detect the presence of SARS-CoV antibodies in a 
sample. Briefly, latex beads, red blood cells or other agglu- 
tinable particles are coated with a SARS-CoV polypeptide 
of this disclosure and mixed with a sample, such that 
antibodies in the sample that are specifically reactive with 
the antigen crosslink with the antigen, causing agglutination. 
The agglutinated polypeptide-antibody complexes form a 
precipitate, visible with the naked eye or measurable by 
spectrophotometer. In a modification of the above test, 
SARS-CoV-specific antibodies of this disclosure can be 
bound to the agglutinable particles and SARS-CoV antigen 
in the sample thereby detected. 

VII. Phannaceutical and Immune Stimulatory Compositions 
and Uses Thereof 

Phannaceutical compositions including SARS-CoV 
nucleic acid sequences, SARS-CoV polypeptides, or anti¬ 
bodies that bind these polypeptides, are also encompassed 
by the present disclosure. These phannaceutical composi¬ 
tions include a therapeutically effective amount of one or 
more SARS-CoV polypeptides, one or more nucleic acid 
molecules encoding a SARS-CoV polypeptide, or an anti¬ 
body that binds a SARS-CoV polypeptide, in conjunction 
with a pharmaceutically acceptable carrier. 

Disclosed herein are substances suitable for use as 
immune stimulatory compositions for the inhibition or treat- 
ment of SARS. Particular immune stimulatory compositions 
are directed against SARS-CoV, and include antigens 
obtained from SARS-CoV. In one embodiment, an immune 
stimulatory composition contains attenuated SARS-CoV. 
Methods of viral attenuation are well know in the art, and 
include, but are not limited to, high serial passage (for 
example, in susceptible host cells under specific environ¬ 
mental conditions to select for attenuated virions), exposure 
to a mutagenic agent (for example, a Chemical mutagen or 
radiation), genetic engineering using recombinant DNA 
technology (for example, using gene replacement or gene 
knockout to disable one or more viral genes), or some 
combination thereof. 

In another embodiment, the immune stimulatory compo¬ 
sition contains inactivated SARS-CoV. Methods of viral 
inactivation are well known in the art, and include, but are 
not limited to, heat and Chemicals (for example, formalin, 
ß-propiolactone, and ehtylenimines). 

In yet another embodiment, the i mmu ne stimulatory com¬ 
position contains a nucleic acid vector that includes SARS- 
CoV nucleic acid molecules described herein, or that 
includes a nucleic acid sequence encoding an immunogenic 
polypeptide or polypeptide fragment of SARS-CoV or 
derived ffoni SARS-CoV, such as a polypeptide that encodes 
a surface protein of SARS-CoV. 

In a further embodiment, the i mmu ne stimulatory com¬ 
position contains a SARS-CoV subunit, such as glycopro- 
tein, major capsid protein, or other gene products found to 
elicit humoral and/or cell mediated immune responses. 

The provided immune stimulatory SARS-CoV polypep¬ 
tides, constracts or vectors encoding such polypeptides, are 
combined with a pharmaceutically acceptable carrier or 
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vehicle for adminiStation as an immune stimulatory com- 
position to human or animal subjects. In some embodiments, 
more than one immune stimulatory SARS-CoV polypeptide 
may be combined to form a single preparation. 

Hie immunogenic formulations may be conveniently pre- 
sented in unit dosage fonn and prepared using conventional 
pharmaceutical techniques. Such techniques include the Step 
of bringing into association the active ingredient and the 
pharmaceutical carrier(s) or excipient(s). In general, the 
formulations are prepared by unifonnly and intimately 
bringing into association the active ingredient with liquid 
carriers. Formulations suitable for parenteral administration 
include aqueous and non-aqueous sterile injection Solutions 
which may contain anti-oxidants, buffers, bacteriostats and 
solutes which render the formulation isotonic with the blood 
of the intended recipient; and aqueous and non-aqueous 
sterile suspensions which may include suspending agents 
and thickening agents. The formulations may be presented in 
unit-dose or multi-dose Containers, for example, sealed 
ampules and vials, and may be stored in a freeze-dried 
(lyophilized) condition requiring only the addition of a 
sterile liquid carrier, for example, water for injections, 
immediately prior to use. Extemporaneous injection Solu¬ 
tions and suspensions may be prepared front sterile powders, 
granules and tablets commonly used by one of ordinary skill 
in the art. 

In certain embodiments, unit dosage formulations are 
those containing a dose or unit, or an appropriate fraction 
thereof, of the administered ingredient. It should be under- 
stood that in addition to the ingredients particularly men- 
tioned above, formulations encompassed herein may include 
other agents commonly used by one of ordinary skill in the 
art. 

The compositions provided herein, including those for use 
as immune stimulatory compositions, may be administered 
through different routes, such as oral, including buccal and 
sublingual, rectal, parenteral, aerosol, nasal, intramuscular, 
subcutaneous, intradermal, and topical. They may be admin¬ 
istered in different forms, including but not limited to 
Solutions, emulsions and suspensions, microspheres, par- 
ticles, microparticles, nanoparticles, and liposomes. 

The volume of administration will vary depending on the 
route of administration. By way of example, intramuscular 
injections may ränge from about 0.1 ml to about 1.0 ml. 
Those of ordinary skill in the art will know appropriate 
volumes for different routes of administration. 

A relatively recent development in the field of immune 
stimulatory compounds (for example, vaccines) is the direct 
injection of nucleic acid molecules encoding peptide anti- 
gens (broadly described in Janeway & Travers, Immunobi- 
ology: The Immune System In Health and Disease, page 
13.25, Garland Publishing, Inc., New York, 1997; and 
McDonnell & Askari, N. Engl. J. Med. 334:42-45, 1996). 
Vectors that include nucleic acid molecules described 
herein, or that include a nucleic acid sequence encoding an 
immunogenic SARS-CoV polypeptide may be utilized in 
such DNA vaccination methods. 

Thus, the term “immune stimulatory composition” as 
used herein also includes nucleic acid vaccines in which a 
nucleic acid molecule encoding a SARS-CoV polypeptide is 
administered to a subject in a pharmaceutical composition. 
For genetic immunization, suitable delivery methods known 
to those skilled in the art include direct injection of plasmid 
DNA into muscles (Wollf et al., Hum. Mol. Genet. 1:363, 
1992), delivery of DNA complexed with specific protein 
carriers (Wu et al., J. Biol. Chem. 264:16985, 1989), co- 
precipitation of DNA with calcium phosphate (Benvenisty 
and Reshef, Proc. Natl. Acad. Sei. 83:9551, 1986), encap- 
sulation of DNA in liposomes (Kaneda et al., Science 
243:375, 1989), particle bombardment (Tang et al., Nature 
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356:152, 1992; Eisenbraun et al., DNA Cell Biol. 12:791, 
1993), and in vivo infection using cloned retroviral vectors 
(Seeger et al., Proc. Natl. Acad. Sei. 81:5849, 1984). Simi- 
larly, nucleic acid vaccine preparations can be administered 
5 via viral carrier. 

The amount of immunostimulatory compound in each 
dose of an immune stimulatory composition is selected as an 
amount that induces an immunostimulatory or innnunopro- 
tective response without significant, adverse side effects. 
I0 Such amount will vary depending upon which specific 
immunogen is employed and how it is presented. Initial 
injections may ränge from about 1 pg to about 1 mg, with 
some embodiments having a ränge of about 10 pg to about 
800 pg, and still other embodiments a ränge of from about 
25 pg to about 500 pg. Following an initial administration of 
15 the immune stimulatory composition, subjects may receive 
one or several booster administrations, adequately spaced. 
Booster administrations may ränge from about 1 pg to about 
1 mg, with other embodiments having a ränge of about 10 
pg to about 750 pg, and still others a ränge of about 50 pg 
20 to about 500 pg. Periodic boosters at intervals of 1-5 years, 
for instance three years, may be desirable to maintain the 
desired levels of protective immunity. 

It is also contemplated that the provided immunostimu¬ 
latory molecules and compositions can be administered to a 
25 subject indirectly, by first stimulating a cell in vitro, which 
stimulated cell is thereafter administered to the subject to 
elicit an immune response. Additionally, the pharmaceutical 
or immune stimulatory compositions or methods of treat- 
ment may be administered in combination with other thera- 
30 peutic treatments. 

VIII. Kits 

Also provided herein are kits useful in the detection 
and/or diagnosis of SARS-CoV. This includes kits for use 
with nucleic acid and protein detection methods, such as 
35 those disclosed herein. 

The SARS-CoV-specific oligonucleotide primers and 
probes described herein can be supplied in the form of a kit 
for use in detection of SARS-CoV. In such a kit, an appro¬ 
priate amount of one or more of the oligonucleotides is 
40 provided in one or more Containers, or held on a Substrate. 
An oligonucleotide primer or probe can be provided in an 
aqueous solution or as a freeze-dried or lyophilized powder, 
for instance. The container(s) in which the 
oligonucleotide(s) are supplied can be any conventional 
45 Container that is capable of holding the supplied form, for 
instance, microfuge tubes, ampoules, or bottles. In some 
applications, pairs of primers are provided in pre-measured 
single use amounts in individual (typically disposable) tubes 
or equivalent Containers. With such an arrangement, the 
5 q sample to be tested for the presence of a SARS-CoV nucleic 
acid can be added to the individual tubes and amplification 
carried out directly. 

The amount of each oligonucleotide supplied in the kit 
can be any appropriate amount, and can depend on the 
.. market to which the product is directed. For instance, if the 
kit is adapted for research or clinical use, the amount of each 
oligonucleotide primer provided would likely be an amount 
sufScient to prime several PCR amplification reactions. 
General guidelines for determining appropriate amounts can 
be found, for example, in Sambrook et al., Molecular 
60 Cloning: A Laboratory Manual , Cold Spring Harbor Labo- 
ratory Press, Cold Spring Harbor, N.Y., 2001; Ausubel et al. 
(eds.), Short Protocols in Molecular Biology, John Wiley 
and Sons, New York, N.Y., 1999; and Innis et al., PCR 
Applications, Protocols for Functional Genomics, Academic 
65 Press, Inc., San Diego, Califi, 1999. Akit can include more 
than two primers, in order to facilitate the amplification of 
a larger nmnber of SARS-CoV nucleotide sequences. 
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In some embodiments, kits also include one or more 
reagents necessary to carry out in vitro amplification reac- 
tions, including DNA sample preparation reagents, appro- 
priate bulfers (for example, polymerase buffer), salts (for 
example, magnesium Chloride), and deoxyribonucleotides 
(dNTPs). 

Kits can include either labeled or imlabeled oligonucle- 
otide primers and/or probes for use in detection of SARS- 
CoV nucleotide sequences. The appropriate sequences for 
such a probe will be any sequence that falls between the 
annealing sites of the two provided oligonucleotide primers, 
such that the sequence that the probe is complementary to is 
amplified during the amplification reaction. 

One or more control sequences for use in the amplifica¬ 
tion reactions also can be supplied in the kit. In other 
particular embodiments, the kit includes equipment, 
reagents, and instructions for extracting and/or purifying 
nucleotides from a sample. 

Kits for the detection of SARS-CoV antigen include for 
instance at least one SARS-CoV antigen-specific binding 
agent (for example, a polyclonal or monoclonal antibody or 
antibody fragment). The kits may also include means for 
detecting antigen: specific binding agent complexes, for 
instance the specific binding agent may be detectably 
labeled. If the specific binding agent is not labeled, it may be 
detected by second antibodies or protein A, for example, 
which may also be provided in some kits in one or more 
separate Containers. Such techniques are well known. 

Another example of an assay kit provided herein is a 
recombinant SARS-CoV-specific polypeptide (or fragment 
thereof) as an antigen and an enzyme-conjugated anti¬ 
human antibody as a second antibody. Examples of such kits 
also can include one or more enzymatic Substrates. Such kits 
can be used to test if a sample from a subject contains 
antibodies against a SARS-CoV-specific protein. 

The subject matter of the present disclosure is llirther 
illustrated by the following non-limiting Examples. 
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EXAMPLES 
Example 1 

Isolation and Characterization of SARS-CoV 


Virus Isolation and Ultrastractural Characterization 

This example describes the original isolation and cliarac- 
to terization of a new human Coronavirus from patients with 
SARS. 

A variety of clinical specimens (blood, serum, material 
from oropharyngeal swabs or washings, material from 
nasopharyngeal swabs, and tissues of major Organs collected 
15 at autopsy) from patients meeting the case definition of 
SARS were sent to the Centers for Disease Control and 
Prevention (CDC) as part of the etiologic investigation of 
SARS. These samples were inoculated onto a number of 
continuous cell lines, including Vero E6, NCI-H292, 
2 0 MDCK, LLC-MK2, and B95-8 cells, and into suckling ICR 
mice by the intracranial and intraperitoneal routes. All 
cultures were observed daily for CPE. Maintenance medium 
was replenished at day seven, and cultures were terminated 
fourteen days after inoculation. Any cultures exhibiting 
identifiable CPE were subjected to several procedures to 
identify the cause of the effect. Suckling mice were observed 
daily for fourteen days, and any sick or dead mice were 
further tested by preparing a brain Suspension that was 
filtered and subcultured. Mice that remained well after 
fourteen days were killed, and their test results were 
30 recorded as negative. 

Two cell lines, Vero E6 cells and NCI-H292 cells, inocu¬ 
lated with oropharyngeal specimens from Patient 16 (a 46 
year old male physician with an epidemiologic link to a 
hospital with multiple SARS patients) initially showed CPE 
(Table 1) 


TABLE 1 


Specimens from patients with SARS that were positive for SARS-CoV by one or more methods*. 


Patient 

No. 

Exposure 

and 

Setting 

Age/Sex 

Findings 
on Chest 
Radiograph 

Hospital- 

ization 

Serologie 

Results 

Specimen 

Isolation 

RT-PCR 

1 

Singapore, 

hospital 

53 yr/F 

Pneumonia 

Yes 

+ 

Nasal, 

oropharyngeal 

swabs 

- 

Not done 

2t 

Hong 

Kong, 

hotel 

36 yr/F 

Pneumonia 

Yes 

+ 

Nasal, swab 


Not done 

3 

Hong 

Kong, 

hotel 

22 yr/M 

Pneumonia 

Yes 

+ 

Swab 



4f 

Hong 

Kong, 

hotel 

39 yr/M 

Pneumonia 

Yes 

+ 

Nasal, 

pharyngeal 

swab 



5 

Hong 

Kong, 

hotel 

49 Yr/M 

Pneumonia 

Yes 

Not done 

Sputum 

+ 

+ 

6t 

Hong 

Kong, 

hotel 

46 yr/M 

Pneumonia 

Yes 

+ 

Kidney, lung, 
bronchoalveolar 
lavage 

+§ 

+ 

7 

Vietnam, 

hospital 

Adult/ 
un known 

Pneumonia 

Yes 

- 

Oropharyngeal 

wash 

+ 

+ 

8 

Vietnam, 

hospital 

Adult/ 
un known 

Pneumonia 

Yes 

_ 

Oropharyngeal 

wash 

- 

+ 

9 

Vietnam, 

hospital 

Adult/ 

unknown 

Pneumonia 

Yes 

- 

Oropharyngeal 

wash 

- 

+ 

10 

Vietnam, 

hospital 

Adult/ 

unknown 

Pneumonia 

Yes 

- 

Oropharyngeal 

wash 

- 

+ 

11 

Vietnam, 

hospital 

Adult/ 

unknown 

Pneumonia 

Yes 

- 

Oropharyngeal 

wash 

- 

+ 
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TABLE 1-continued 


Specimens from patients with SARS that were positive for SARS-CoV by one or more methods*. 


Exposure Findings 

Patient and on Chest Hospital- Serologie 

No. Setting Age/Sex Radiograph ization Results Specimen Isolation RT-PCR 


12 

Vietnam, 
ho spital 

Adult/ 

unknown 

Pneumonia 

Yes 

- 

Oropharyngeal 

wash 

- 

+ 

13 

Vietnam, 

hospital 

Adult/ 

unknown 

Pneumonia 

Yes 

- 

Oropharyngeal 

wash 

+ 

+ 

14 

Vietnam, 

hospital 

Adult/ 

unknown 

Pneumonia 

Yes 

- 

Oropharyngeal 

wash 

- 

+ 

15 

Vietnam, 

hospital 

Adult/ 

unknown 

Pneumonia 

Yes 

- 

Oropharyngeal 

wash 

- 

+ 

16 

Vietnam, 

hospital 

46 yr/M 

Pneumonia 

Yes 

+ 

Nasal, 

oropharyngeal 

swabs 

+1 

+ 

17 

Canada, 

family 

43 yr/M 

Pneumonia 

Yes 

Not done 

Lung, bone 
marrow 

- 


18 

Taiwan, 

family 

51 yr/F 

Pneumonia 

Yes 

- 

Sputum 

- 

+ 

19 

Hong 

Kong, 

hotel 

Adult/F 

Pneumonia 

Yes 

+ 

Oropharyngeal 

wash 


+ 


*Plus signs denote positive results, and minus signs negative results. The Serologie and RT-PCR assays were not necessarily performed on 
samples obtained at the same time. 

tThis was a late specimen, antibody positive at first sample. 

JTravel included China, Hong Kong (hotel), and Hanoi (the patient was the index patient in the French Hospital). 

§ Isolation was from the kidney only. 

^Isolation was from the oropharyngeal only. 


The CPE in the Vero E6 cells was first noted on the fifth 30 
day post-inoculation; it was focal, with cell rounding and a 
refractive appearance in the affected cells that was soon 
followed by cell detachment (FIG. 1A). The CPE spread 
quickly to involve the entire cell monolayer witliin 24 to 48 
hours. Subculture of material after preparation of a master 35 
seed stock (used for subsequent antigen production) resulted 
in the rapid appearance of CPE, as noted above, and in 
complete destruction of the monolayer in the inoculated 
flasks within 48 hours. Similar CPE was also noted in four 
additional cultures: three cultures of respiratory specimens 
(two oropharyngeal washes and one sputum specimen) and 
one culture of a Suspension of kidney tissue obtained at 
autopsy. In these specimens, the initial CPE was observed 
between day two and day four and, as noted above, the CPE 
rapidly progressed to involve the entire cell monolayer. 

Tissue culture samples showing CPE were prepared for 45 
electron-microscopical examination. Negative-stain elec- 
tron-microscopical specimens were prepared by drying cul¬ 
ture supematant, mixed 1:1 with 2.5% paraformaldehyde, 
onto Formvarcarbon-coated grids and staining with 2% 
methylamine tungstate. Thin-section electron-microscopical 50 
specimens were prepared by fixing a washed cell pellet with 
2.5% glutaraldehyde and embedding the cell pellet in epoxy 
resin. In addition, a master seed stock was prepared from the 
remaining culture supematant and cells by freeze-thawing 
the culture flask, clarifying the thawed contents by centrifii- 
gation at lOOOxg, and dispensing the supematant into ali- 
quots stored in gas phase over liquid nitrogen. The master 
seed stock was subcultured into 850-cm 2 roller bottles of 
Vero E6 cells for the preparation of formalin-fixed positive 
control cells for innnunohistochemical analysis, mixed with 
normal Vero E6 cells, and gamma-irradiated for preparation 60 
of spot slides for IFA tests or extracted with detergent and 
gamma-irradiated for use as an ELISA antigen for antibody 
tests. 

Examination of CPE-positive Vero E6 cells by thin- 
section electron microscopy revealed characteristic coro- 65 
naviras particles within the cistemae of the rough endoplas- 
mic reticulum and in vesicles (FIG. 2A) (Becker et al., J. 


Virol. 1:1019—27, 1967; Oshiro et al. J. Gen. Virol 
12:161-8, 1971). Extracellular particles were found in large 
clusters and adhering to the surface of the plasma mem- 
brane. Negative-stain electron microscopy identified Coro¬ 
navirus particles, 80 to 140 nm in diameter, with 20- to 
40-mn complex surface projections surrounding the periph- 
ery (FIG. 2B). Hemagglutinin esterase-type glycoprotein 
projections were not seen. 

The isolation and growth of a human-derived Coronavirus 
in Vero E6 cells were unexpected. The previously known 
human coronaviruses are notably fastidious, preferring 
select cell lines, organ culture, or suckling mice for propa- 
gation. The only human or animal Coronavirus which has 
been shown to grow in Vero E6 cells is PEDV, and it requires 
the addition of trypsin to culture medium for growth in the 
cells. Moreover, PEDV adapted to growth in Vero E6 cells 
results in a strikingly difi'erent CPE, with cytoplasmic vacu- 
oles and the formation of large syncytia. Syncytial cells were 
only observed occasionally in monolayers of Vero E6 cells 
infected with the SARS-CoV; they clearly do not represent 
the dominant CPE. 

Reverse Transcription-Polymerase Chain Reaction and 
Sequencing 

For RT-PCR assays, cell-culture supernatants were placed 
in lysis buffer. RNA extracts were prepared from 100 p.1 of 
each specimen (or culture supematant) with the automated 
NucliSens extraction System (bioMerieux, Durham, N.C.). 
Initially, degenerate, inosine-containing primers IN-2 (+) 
5’GGGTTGGGACTA TCCTAAGTGTGA3' (SEQ ID NO: 
34) and IN-4 (-) 5'TAACACACAACICCATCA TCA3' 
(SEQ ID NO: 35) were designed to aimeal to sites encoding 
conserved amino acid motifs that were identified on the 
basis of alignments of available coronavims ORFla, 
ORFlb, S, F1E, M, and N gene sequences. Additional, 
SARS-specific, primers Cor-p-F2 (+) 5'CTAACATGCT- 
TAGGATAATGG3' (SEQ ID NO: 13), Cor-p-F3 (+) 
5'GCCTCTCTTGTTCTTGCTCGC3' (SEQ ID NO: 14), 
and Cor-p-Rl (-) 5’CAGGTAAGCGTAAAACTCATC3 
(SEQ ID NO: 15) were designed as sequences were gener- 



US 7,220,852 Bl 


31 

ated from RT-PCR products amplified with the degenerate 
primers. These SARS-specific primers were used to test 
patient specimens for SARS (see below). Primers used for 
specific amplification of human metapneumovirus have 
been described by Falsey et al. ( J. Infect. Dis. 87:785-90, 
2003). 

For RT-PCR products of less than 3 kb, cDNA was 
synthesized in a 20 pl reaction mixbire containing 500 ng of 
RNA, 200 U of Superscript™ II reverse transcriptase (Invit- 
rogen Life Technologies, Carlsbad, Calif.), 40 U of RNasin 
(Promega Corp., Madison, Wis.), 100 mM each dNTP 
(Roche Molecular Biochemicals, Indianapolis, Ind.), 4 pl of 
5x reaction buffer (Invitrogen Life Teclinologies, Carlsbad, 
Calif.), and 200 pmol of the reverse primer. Hie reaction 
mixture, except for the reverse transcriptase, was heated to 
70° C. for 2 minutes, cooled to 4° C. for 5 minutes and then 
heated to 42° C. in a thermocycler. The mixture was held at 
42° C. for 4 minutes, and then the reverse transcriptase was 
added, and the reactions were incubated at 42° C. for 45 
minutes. Two microliters of the cDNA reaction was used in 
a 50 pl PCR reaction containing 67 mM Tris-HCl (pH 8 . 8 ), 
1 mM each primer, 17 mM ammonium sulfate, 6 mM 
EDTA, 2 mM MgCl 2 , 200 mM each dNTP, and 2.5 U of Taq 
DNA polymerase (Roche Molecular Biochemicals, India¬ 
napolis, Ind.). The thermocycler program for the PCR con- 
sisted of 40 cycles of denaturation at 95° C. for 30 seconds, 
annealing at 42° C. for 30 seconds, and extension at 65° C. 
for 30 seconds. For SARS-CoV-specific primers, the anneal¬ 
ing temperature was increased to 55° C. 

For amplification of fragments longer than 3 kb, regions 
of the genome between sections of known sequence were 
amplified by rneans of a long RT-PCR protocol and SARS- 
CoV-specific primers. First-strand cDNA synthesis was per- 
formed at 42° C. or 50° C. using Superscript™ II RNase H 
reverse transcriptase (Invitrogen Life Technologies, Carls¬ 
bad, Calif.) according to the manufacturer’s instructions 
with minor modifications. Coronavirus-specific primers 
(500 ng) and SARS-CoV RNA (350 ng) were combined with 
the PCR Nucleotide Mix (Roche Molecular Biochemicals, 
Indianapolis, Ind.), heated for 1 minute at 94° C., and cooled 
to 4° C. in a thermocycler. The 5x first-strand buffer, 
dithiothreitol (Invitrogen Life Technologies, Carlsbad, 
Calif.), and Protector RNase Inhibitor (Roche Molecular 
Biochemicals, Indianapolis, Ind.) were added, and the 
samples were incubated at 42° C. or 50° C. for 2 minutes. 
After reverse transcriptase (200 U) was added, the samples 
were incubated at 42° C. or 50° C. for 1.5 to 2 hours. 
Samples were inactivated at 70° C. for 15 minutes and 
subsequently treated with 2 U of RNase H (Roche Molecular 
Biochemicals, Indianapolis, Ind.) at 37° C. for 30 minutes. 
Long RT-PCR amplification of 5- to 8 -kb fragments was 
performed using Taq Plus Precision (Stratagene, La Jolla, 
Calif.) and AmpliWax PCR Gern 100 beads (Applied Bio¬ 
systems; Foster City, Calif.) for “hot Start” PCR with the 
following thermocycling parameters: denaturation at 94° C. 
for 1 minute followed by 35 cycles of 94° C. for 30 seconds, 
55° C. for 30 seconds, an increase of 0.4 degrees per second 
up to 72° C., and 12° C. for 7 to 10 minutes, with a final 
extension at 12° C. for 10 minutes. RT-PCR products were 
separated by electrophoresis on 0.9% agarose TAE gels and 
purified by use of a QIAquick Gel Extraction Kit (Qiagen, 
Inc., Santa Clarita, Calif.). 

In all cases, the RT-PCR products were gel-isolated and 
purified for sequencing by means of a QIAquick Gel Extrac¬ 
tion kit (Qiagen, Inc., Santa Clarita, Calif.). Both Strands 
were sequenced by automated methods, using fluorescent 
dideoxy-chain terminators (Applied Biosystems; Foster 
City, Calif.). 

The sequence of the leader was obtained from the sub- 
genomic mRNA coding for the N gene and from the 5' 
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temiinus of genomic RNA. The 5' rapid amplification of 
cDNA ends (RACE) technique (Harcourt et al., Virology 
271:334^19, 2000) was used with reverse primers specific 
for the N gene or for the 5' untranslated region. RACE 
5 products were either sequenced directly or were cloned into 
a plasmid vector before sequencing. A primer that was 
specific for the leader of SARS-CoV was used to amplify the 
region between the 5'-tenninus of the genome and known 
sequences in the rep gene. The 3'-terminus of the genome 
was amplified for sequencing by use of an oligo-(dT) primer 
and primers specific for the N gene. 

Once the complete SARS-CoV genomic sequence had 
been detennined, it was confinned by sequencing a series of 
independently amplified RT-PCR products spanning the 
entire genome. Positive- and negative-sense sequencing 
15 primers, at intervals of approximately 300 nt, were used to 
generate a confirmatory sequence with an average redun- 
dancy of 9.1. The confirmatory sequence was identical to the 
original sequence. The genomic sequence (SEQ ID NO: 1) 
was published in the GenBank sequence database (Acces- 
20 sion No. AY278741) on Apr. 21, 2003. 

Sequence Analysis 

Predicted amino acid sequences were compared with 
those from reference viruses representing each species for 
25 which complete genomic sequence infonnation was avail- 
able: group 1 representatives included human Coronavirus 
229E (GenBank Accession No. AF304460), porcine epi- 
demic diarrhea virus (GenBank Accession No. AF353511), 
and transmissible gastroenteritis virus (GenBank Accession 
3Q No. AF271965); group 2 representatives included bovine 
J ' Coronavirus (GenBank Accession No. AF220295) and 
mouse hepatitis virus (GenBank Accession No. AF201929); 
group 3 was represented by infectious bronchitis virus 
(GenBank Accession No. M95169). Sequences for repre- 
sentative strains of other Coronavirus species for which 
35 partial sequence Information was available were included 
for some of the structural protein comparisons: group 1 
representative strains included canine Coronavirus (Gen¬ 
Bank Accession No. Dl3096), feline Coronavirus (GenBank 
Accession No. AY204704), and porcine respiratory coro- 
40 navirus (GenBank Accession No. Z24675); and group 2 
representatives included three strains of human Coronavirus 
OC43 (GenBank Accession Nos. M76373, L14643 and 
M93390), porcine hemagglutinating encephalomyelitis 
virus (GenBank Accession No. AY078417), and rat coro- 
45 navirus (GenBank Accession No. AF207551). 

Partial nucleotide sequences of the polymerase gene were 
aligned with published Coronavirus sequences, using 
CLUSTAL W for Unix (version 1.7; Thompson et al., 
Nucleic Acids Res. 22:4673-80, 1994). Phylogenetic trees 
50 were computed by maximum parsimony, distance, and 
maximum likelihood-based criteria analysis with PAUP 
(version 4.0.dl0; Swofford ed., Phylogenetic Analysis using 
Parsimony and other Methods, Sinauer Associates, Sunder¬ 
land, Mass.). When compared with other human and animal 
coronavimses, the nucleotide and deduced a min o acid 
55 sequence from this region had similarity scores ranging from 
0.56 to 0.63 and from 0.57 to 0.74, respectively. The highest 
sequence similarity was obtained with group II coronavi- 
rases. The maximum-parsimony tree obtained from the 
nucleotide-sequence alignment is shown in FIG. 3. Boot- 
60 strap analyses of the internal nodes at the internal branches 
of the tree provided strong evidence that the SARS-CoV is 
genetically distinct from other known coronavimses. 

Microarray analyses (using a long oligonucleotide DNA 
microarray with array elements derived from highly con- 
65 served regions within viral families) of samples from 
infected and uninfected cell cultures gave a positive signal 
for a group of eight oligonucleotides derived from two virus 
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families: Coronaviridae and Astroviridae (Wang et al., PNAS 
99:15687-92, 2002). All of the astroviruses and two of the 
Coronavirus oligonucleotides share a consensus sequence 
motif that maps to the extreme 3'-end of astroviruses and two 
members of the Coronavirus family: avian infectious bron- 
chitis and turkey Coronavirus (Jonassen et al., J. Gen. Virol. 
79:715-8,1998). Results were consistent with the identity of 
the isolate as a Coronavirus. 

Additional sequence alignments and neighbor-joining 
trees were generated by using ClustalX (Thompson et al., 
Nucleic Acids Res. 25:4876-82, 1997), Version 1.83, with 
the Gönnet protein comparison matrix. The resulting trees 
were adjusted for final output by using treetool Version 2.0.1. 
Uncorrected pairwise distances were calculated from the 
aligned sequences by using the Distances program from the 
Wisconsin Sequence Analysis Package, Version 10.2 (Accel- 
rys, Burlington, Mass.). Distances were converted to percent 
identity by subtracting from 100. The amino acid sequences 
for three well-defined enzymatic proteins encoded by the rep 
gene and the four major structural proteins of SARS-CoV 
were compared with those from representative viruses for 
each of the species of Coronavirus for which complete 
genomic sequence information was available (FIG. 4, Table 
2). The topologies of the resulting phylograms are remark- 
ably similar (FIG. 4). For each protein analyzed, the species 
formed monophyletic clusters consistent with the estab- 
lishedtaxonomic groups. In all cases, SARS-CoV sequences 
segregated into a fourth, well-resolved branch. These clus¬ 
ters were supported by bootstrap values above 90% (1000 
replicates). Consistent with pairwise comparisons between 
the previously characterized Coronavirus species (Table 2), 
there was greater sequence Conservation in the enzymatic 
proteins (3CI/ re> , polymerase (POL), and helicase (HEL)) 
than among the structural proteins (S, E, M, and N). These 
results indicate that SARS-CoV is not closely related to any 
of the previously characterized coronaviruses and forms a 
distinct group within the genus Coronavirus. 
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87:785-90, 2003. Positive and negative RT-PCR Controls, 
containing standardized viral RNA extracts, and nuclease- 
free water were included in each run. Amplified 6 -FAM- 
labeled products were analyzed by capillary electrophoresis 
5 on an ABI 3100 Prism Genetic Analyzer with GeneScan 
Software (version 3.1.2; Applied Biosystems; Foster City, 
Calif.). Specimens were considered positive for SARS-CoV 
if the amplification products were within one nucleotide of 
the expected product size (368 nucleotides for Cor-p-F2 or 
Cor-p-Rl and348nucleotides forCor-p-F3 orCor-p-Rl) for 
both specific primer sets, as confirmed by a second PCR 
reaction from another aliquot of RNA extract in a separate 
laboratory. Where DNA yield was sufficient, the amplified 
products were also sequenced. Additionally, as described 
above, microarray-based detection of SARS-CoV in patient 
15 specimens was carried out (Wang et al., PNAS 99:15687-92, 
2002 and Bohlander et al., Genomics 13:1322-24, 1992). 

Example 3 

2 q Immunohistochemical and Histopathological 

Analysis, and Electron-Microscopical Analysis of 
Bronchoalveolar Lavage Fluid 

This example illustrates innnunohistochemical, histo- 
25 pathological and electron-microscopical analysis of Vero E 6 
cells infected with the SARS-CoV and tissue samples from 
SARS patients. 

Fonnalin-fixed, parafün-embedded Vero E 6 cells infected 
with the SARS-CoV and tissues obtained from patients with 
SARS were stained with hematoxylin and eosin and various 
1 immunohistochemical stains. Innnunohistochemical assays 
were based on a method described previously for hantavirus 
(Zaki et al., Amer. J. Pathol. 146:552-79, 1995). Briefly, 
4-pm sections were deparaffinized, rehydrated, and digested 
in Proteinase K for 15 minutes. Südes were then incubated 
for 60 minutes at room temperature with monoclonal anti- 


TABLE 2 




Pairwise amino acid identities of Coronavirus proteins. 




Group 

Virus 

3CLPRO 

POL 

HEL 

S 

E 

M 

N 





Pairwise Amino Acid Identity (Percent) 



Gl 

HCoV-229E 

40.1 

58.8 

59.7 

23.9 

22.7 

28.8 

23.0 


PEDV 

44.4 

59.5 

61.7 

21.7 

17.6 

31.8 

22.6 


TGEV 

44.0 

59.4 

61.2 

20.6 

22.4 

30.0 

25.6 

G2 

BCoV 

48.8 

66.3 

68.3 

27.1 

20.0 

39.7 

31.9 


MHV 

49.2 

66.5 

67.3 

26.5 

21.1 

39.0 

33.0 

G3 

IBV 

41.3 

62.5 

58.6 

21.8 

18.4 

27.2 

24.0 





Predicted Protein Length (aa) 




SARS-CoV 

306 

932 

601 

1255 

76 

221 

422 


CoV Range 

302-307 

923-940 

506-600 

1173-1452 76-108 

225-262 

377-154 


Example 2 

Detection of SARS-CoV in a Subject 

Ulis example demonstrates the detection of SARS-CoV 
in patient specimens using SARS-CoV-specific primers. 

The SARS-specific primers Cor-p-F2 (SEQ ID NO: 13), 
Cor-p-F3 (SEQ ID NO: 14) and Cor-p-Rl (SEQ ID NO: 15) 
were used to test patient specimens for SARS. One primer 
for each set was 5'-end-labeled with 6 -FAM to facilitate 
GeneScan analysis. One-step amplification reactions were 
performed with the Access RT-PCR System (Promega, 
Madison, Wis.) as described by Falsey et al., J. Infect. Dis. 


bodies, polyclonal antiserum or ascitic fluids derived from 
animal species with reactivities to various known coronavi¬ 
ruses, and with a convalescent-phase serum specimen from 
a patient with SARS. 

Optimal dilutions of the primary antibodies were deter- 
mined by titration experiments with coronavirus-infected 
cells from patients with SARS and with noninfected cells or, 
when available, with concentrations recommended by the 
manufacturers. After sequential application of the appropri- 
ate biotinylated link antibody, avidin-alkaline phosphatase 
complex, and naphthol-fast red Substrate, sections were 
counterstained in Mayer’s hematoxylin and mounted with 
aqueous mounting medium. The following antibody and 
tissue Controls were used: serum specimens from nonin- 


55 


60 


65 
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fected animals, various coronavirus-infected cell cultures 
and animal tissues, noninfected cell cultures, and normal 
human and animal tissues. Tissues from patients were also 
tested by immunohistochemical assays for various other 
viral and bacterial pulmonary pathogens. In addition, a BAL 
specimen was available from one patient for thin-section 
electron-microscopical evaluation. 

Lung tissues were obtained from tlie autopsy of three 
patients and by open lung biopsy of one patient, 14-19 days 
following onset of SARS Symptoms. Confirmatory labora- 
tory evidence of infection with Coronavirus was available for 
two patients (patients 6 and 17) and included PCR ampli- 
fication of Coronavirus nucleic acids from tissues, viral 
isolation from BAL fluid or detection of serum antibodies 
reactive with Coronavirus (Table 1). For two patients, no 
samples were available for molecular, cell culture, or sero- 
logical analysis; however, botli patients met the CDC defi- 
nition for probable SARS cases and had strong epidemio- 
logic links with laboratory-confirmed SARS cases. 
Histopathologie evaluation of lung tissues of the four 
patients showed diffuse alveolar damage at various levels of 
Progression and severity. Changes included hyaline mem- 
brane formation, interstitial mononuclear inflanunatory 
infiltrates, and desquamation of pneumocytes in alveolar 
spaces (FIG. 5A). Other findings identified in some patients 
included focal intraalveolar hemorrhage, necrotic inflanuna¬ 
tory debris in small airways, and organizing pneumonia. 
Multinucleated syncytial cells were identified in the intraal¬ 
veolar spaces of two patients who died 14 and 17 days, 
respectively, after onset of illness. These cells contained 
abundant vacuolated cytoplasm with cleaved and convo- 
luted nuclei. No obvious intranuclear or intracytoplasmic 
viral inclusions were identified (FIG. 5B), and electron- 
microscopical examination of a limited number of these 
syncytial cells revealed no Coronavirus particles. No defini¬ 
tive immunostaining was identified in tissues from SARS 
patients with the use of a battery of immunohistochemical 
stains reactive with coronaviruses from antigenic groups I, 
II, and III. In addition, no staining of patient tissues was 
identified with the use of immunohistochemical stains for 
influenzaviruses A and B, adenoviruses, Hendra and Nipah 
viruses, human metapneumovirus, respiratory syncytial 
virus, measles virus, Mycoplasma pneumoniae , and Chlamy¬ 
dia pneumoniae. 

Evaluation of Vero E6 cells infected with Coronavirus 
isolated from a patient with SARS revealed viral CPE that 
included occasional multinucleated syncytial cells but no 
obvious viral inclusions (FIG. 5C). Immunohistochemical 
assays with various antibodies reactive with coronaviruses 
from antigenic group I, including HCoV-229E, FIPV and 
TGEV, and with an immune serum specimen from a patient 
with SARS, demonstrated strong cytoplasmic and membra- 
nous staining of infected cells (FIG. 5C and Table 3); 
however, cross-reactivity with the same immune human 
serum sample and FIPV antigen was not observed. No 
staining was identified with any of several monoclonal or 
polyclonal antibodies reactive with coronaviruses in anti¬ 
genic group II (HCoV-OC43, BCoV and MHV) or group III 
(TCoV and IBV-Avian). Electron microscopical examina¬ 
tion of a BAL fluid from one patient revealed many coro¬ 
navirus-infected cells (FIGS. 6A-B). 


TABLE 3 


Immunohistochemical reactivities of various polyclonal group I anti- 
coronavirus reference antiserum samples with a Coronavirus isolated from 
a patient with SARS and with selected antieenic eroup I coronaviruses. 


Immunohistochemical reactivity of antiserum with 
coronavirus-infected culture cells 

Antiserum 

SARS-CoV 
(Vero E6) 

HCoV-229E 
(mouse 3T3- 
hAPN) 

FIPV-1 

(BHK-fAPN) 

Convalescent- 
phase SARS 
(patient 3) 

+ 

+ 

- 

Guinea pig anti- 
HCoV-229E 

+ 

+ 

- 

Rabbit anti- 
HCoV-229E 

+ 

+ 

+ 

Feline anti- 
FIPV-1 

+ 

+ 

+ 

Porcine anti- 
TGEV 

+ 


+ 


25 


Example 4 

SARS-CoV Serologie Analysis 


This example illustrates representative methods of per- 
forming serological analysis of SARS-CoV. 

Spot slides were prepared by applying 15 pl of the 
Suspension of gamma-irradiated mixed infected and nonin¬ 
fected cells onto 12-well Teflon-coated slides. Slides were 
allowed to air dry before being fixed in acetone. Slides were 
then stored at -70° C. until used for IFA tests (Wulff and 
35 Lange, Bull. WHO 52:429-36, 1975). An ELISA antigen 
was prepared by detergent extraction and subsequent gamma 
irradiation of infected Vero E6 cells (Ksiazek et al., J. Infect. 
Dis. 179 (suppl. 1 ):S191—8, 1999). The optimal dilution 
(1:1000) for the use of this antigen was determined by 
40 checkerboard titration against SARS patient serum from the 
convalescent phase; a control antigen, similarly prepared 
from uninfected Vero E6 cells, was used to control for 
specific reactivity of tested sera. The conjugates used were 
goat antihuman IgG, IgA, and IgM conjugated to fluorescein 
45 isothiocyanate and horseradish peroxidase (Kirkegaard and 
Perry, Gaithersburg, Md.), for the IFA test and ELISA, 
respectively. Specificity and cross-reactivity of a variety of 
serum samples to the newly identified virus were evaluated 
by using the tests described herein. For this evaluation, 
50 serum from SARS patients in Singapore, Bangkok and Hong 
Kong was used, along with serum from healthy blood donors 
from the CDC serum bank and from persons infected with 
known human Coronavirus (human coronaviruses OC43 and 
229E) (samples provided by E. Walsh and A. Falsey, Uni- 
versity of Rochester School of Medicine and Dentistry, 
55 Rochester, N.Y.). 

Spot slides with infected cells reacted with serum from 
patients with probable SARS in the convalescent phase 
(FIG. 1B). Screening of a panel of serum from patients with 
suspected SARS from Hong Kong, Bangkok, Singapore as 
611 well as the United States showed a high level of specific 
reaction with infected cells, and conversion from negative to 
positive reactivity or diagnostic rises in the IFA test by a 
factor of four. Similarly, tests of these same serum samples 
with the ELISA antigen showed high specific signal in the 
65 convalescent-phase samples and conversion from negative 
to positive antibody reactivity or diagnostic increases in titer 
(Table 4). 
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TABLE 4 


Results of serological testing with both IFA assay and ELISA in SARS 
patients tested against the newly isolated human Coronavirus. 

Source 

Serum No. 

Days After Onset 

ELISA Titer* 

IFA Titer* 

Hong Kong 

1.1 

4 

<100 

<25 

Hong Kong 

1.2 

13 

^6400 

1600 

Hong Kong 

2.1 

11 

400 

100 

Hong Kong 

2.2 

16 

1600 

200 

Hong Kong 

3.1 

7 

<100 

<25 

Hong Kong 

3.2 

17 

^6400 

800 

Hong Kong 

4.1 

8 

<100 

<25 

Hong Kong 

4.2 

13 

1600 

50 

Hong Kong 

5.1 

10 

100 

<25 

Hong Kong 

5.2 

17 

^6400 

1600 

Hong Kong 

6.1 

12 

1600 

200 

Hong Kong 

6.2 

20 

^6400 

6400 

Hong Kong 

7.1 

17 

400 

50 

Hong Kong 

7.2 

24 

^6400 

3200 

Hong Kong 

8.1 

3 

<100 

<25 

Hong Kong 

8.2 

15 

^6400 

200 

Hong Kong 
(Hanoi) 

9.1 

5 

<100 

<25 

Hong Kong 

9.2 

11 

^6400 

1600 

Bangkok 

1.1 

2 

<100 

<25 

Bangkok 

1.2 

4 

<100 

<25 

Bangkok 

1.3 

7 

<100 

<25 

Bangkok 

1.4 

15 

1600 

200 

United States 

1.1 

2 

<100 

<25 

United States 

1.2 

6 

400 

50 

United States 

1.3 

13 

^6400 

800 

Singapore 

1.1 

2 

100 

<25 

Singapore 

1.2 

11 

^6400 

800 

Singapore 

2.1 

6 

100 

<25 

Singapore 

2.2 

25 

i?6400 

400 

Singapore 

3.1 

6 

100 

<25 

Singapore 

3.2 

14 

1=6400 

400 

Singapore 

4.1 

5 

100 

<25 

Singapore 

4.2 

16 

1600 

400 


*Reciprocal of dilution 


Information from the limited numbers of samples tested 
suggests that antibody is first detectable by IFA assay and 
ELISA between one and two weeks afier the onset of 
Symptoms in the patient. IFA testing and ELISA of a panel 40 
of 384 randomly selected serum samples (from U.S. blood 
donors) were negative for antibodies to the new Coronavirus, 
with the exception of one specimen that had minimal 
reactivity on ELISA. A panel of paired human serum 
samples with diagnostic increases (by a factor of four or 45 
more) in antibody (with very high titers to the homologous 
viral antigen in the convalescent-phase serum) to the two 
known human coronaviruses, OC43 (13 pairs) and 229E (14 
pairs), showed no reactivity in either acute- or convalescent- 
phase serum with the newly isolated Coronavirus by either 
the IFA test or the ELISA. 


Example 5 

Poly(A) + RNA Isolation and Northern 
Hybridization 

This example illustrates a representative method of North¬ 
ern hybrididization to detect SARS-CoV messages in Vero 
E 6 cells. 

Total RNA from infected or uninfected Vero E 6 cells was 60 
isolated with Trizol reagent (Invitrogen Life Technologies, 
Carlsbad, Calif.) used according to the manufacturer’s rec- 
ommendations. Poly(A) + RNA was isolated from total RNA 
by use of the Oligotex Direct mRNAKit (Qiagen, Inc., Santa 
Clarita, Calif.), following the instractions for the batch 65 
protocol, followed by ethanol precipitation. RNA isolated 
from 1 cm 2 of cells was separated by electrophoresis on a 
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0.9% agarose gel containing 3.7% fonnaldehyde, followed 
by partial alkaline hydrolysis (Ausubel et al. eds. Current 
Protocols in Moleeular Biology, vol. 1, John Wiley & Sons, 
Inc., NY, N.Y., Ch. 4.9, 1996). RNA was transferred to a 
nylon membrane (Roche Moleeular Biochemicals, India¬ 
napolis, Ind.) by vaeuum blotting (Bio-Rad, Hercules, 
Calif.) and fixed by UV cross-linking. The DNAtemplate for 
probe synthesis was generated by RT-PCR amplification of 
SARS-CoV nt 29,083 to 29,608 (SEQ ID NO: 1), by using 
a reverse primer containing a T7 RNA polymerase promoter 
to facilitate generation of a negative-sense riboprobe. In 
vitro transcription of the digoxigenin-labeled riboprobe, 
hybridization, and detection of the bands were carried out 
with the digoxigenin System by using manufacturer’s rec- 
onnnended procedures (Roche Moleeular Biochemicals, 
Indianapolis, Ind.). Signals were visualized by chemilumi- 
nescence and detected with x-ray film. 

Example 6 

SARS-CoV Genome Organization 

This example illustrates the genomic Organization of the 
SARS-CoV genome, including the location of SARS-CoV 
ORFs. 

The genome of SARS-CoV is a 29,727-nucleotide, poly- 
adenylated RNA, and 41% of the residues are G or C (ränge 
forpublished Coronavirus complete genome sequences, 37% 
to 42%). The genomic Organization is typical of coronavi¬ 
ruses, having the characteristic gene Order [5'-replicase 
(rep), spike (S), envelope (E), membrane (M), nucleocapsid 
(N)-3'] and short untranslated regions at both termini (FIG. 
7A, Table 5). The SARS-CoV rep gene, which comprises 
approximately two-tliirds of the genome, encodes two 
polyproteins (encoded by ORFla and ORFlb) that undergo 
co-translational proteolytic processing. There are four ORFs 
downstream of rep that encode the structural proteins, S, E, 

M, and N, which are common to all known coronaviruses. 
The hemagglutinin-esterase gene, which is present between 
ORFlb and S in group 2 and some group 3 coronaviruses 
(Lai and Hohnes, in Fields Virology, eds. Knipe and How- 
ley, Lippincott Williams and Wilkins, New York, 4 th edition, 
2001, Ch. 35), was not found in SARS-CoV. 

Coronaviruses also encode a number of non-structural 
proteins that are located between S and E, between M and 

N, or downstream of N. These non-structural proteins, which 
vary widely among the different Coronavirus species, are of 
unknown fimetion and are dispensable for virus replication 
(Lai and Holmes, in Fields Virology, eds. Knipe and How- 
ley, Lippincott Williams and Wilkins, New York, 4 rh edition, 
2001, Ch. 35). The genome of SARS-CoV contains ORFs 
for five non-structural proteins of greater than 50 amino 
acids (FIG. 7B, Table 5). Two overlapping ORFs encoding 
proteins of 274 and 154 amino acids (termed XI (SEQ ID 
NO: 5) and X2 (SEQ ID NO: 6 ), respectively) are located 
between S (SEQ ID NO: 4) and E (SEQ ID NO: 7). Three 
additional non-structural genes, X3 (SEQ ID NO: 9), X4 
(SEQ ID NO: 10), and X5 (SEQ ID NO: 11) (encoding 
proteins of 63, 122, and 84 amino acids, respectively), are 
located between M (SEQ ID NO: 8 ) and N (SEQ ID NO: 
12). In addition to the five ORFs encoding the non-structural 
proteins described above, there are also two smaller ORFs 
between M and N, encoding proteins of less than 50 amino 
acids. Searches of the GenBank database (BLAST and 
FastA) indicated that there is no significant sequence simi- 
larity between these non-structural proteins of SARS-CoV 
and any other proteins. 
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The Coronavirus rep gene products are translated from 
genomic RNA, but the remaining viral proteins are trans¬ 
lated from subgenomic mRNAs that fonn a 3'-cotenninal 
nested set, each with a 5'-end derived from the genomic 
5'-leader sequence. The Coronavirus subgenomic mRNAs 
are synthesized through a discontinuous transcription pro- 
cess, the mechanism of which has not been unequivocally 
established (Lai and Holmes, in Fields Virology, eds. Knipe 
and Howley, Lippincott Williams and Wilkins, New York, 
4 edition, 2001, Ch. 35; Sawicki and Sawicki, Adv. Exp. 
Med. Biol. 440:215-19, 1998). The SARS-CoV leader 
sequence was mapped by comparing the sequence of 
5'-RACE products synthesized from the N gene mRNA with 
those synthesized from genomic RNA. A sequence, AAAC- 
GAAC (nucleotides 65-72 of SEQ ID NO: 1), was identified 
immediately upstream of the site where the N gene mRNA 
and genomic sequences diverged. This sequence was also 
present upstream of ORFla and immediately upstream of 
five other ORFs (Table 5), suggesting that it functions as the 
conserved core of the transcriptional regulatory sequence 
(TRS). 

In addition to the site at the 5'-terminus of the genome, the 
TRS conserved core sequence appears six times in the 
remainder of the genome. Ute positions of the TRS in the 
genome of SARS-CoV predict that subgenomic mRNAs of 
8.3, 4.5, 3.4, 2.5, 2.0, and 1.7 kb, not including the poly(A) 
tail, should be produced (FIGS. 7A-B, Table 5). At least five 
subgenomic mRNAs were detected by Northern hybridiza- 
tion of RNA from SARS-CoV-infected cells, using a probe 
derived from the 3'-untranslated region (FIG. 7C). The 
calculated sizes of the five predominant bands correspond to 
the sizes of five of the predicted subgenomic mRNAs of 
SARS-CoV; the possibility that other, low-abmidance 
mRNAs are present cannot be excluded. By analogy with 
other coronaviruses (Lai and Holmes, in Fields Virology. 
eds. Knipe and Howley, Lippincott Williams and Wilkins, 
New York, A' h edition, 2001, Ch. 35), the 8.3-kb and 1.7-kb 
subgenomic mRNAs are monocistronic, directing transla- 
tion of S and N, respectively, whereas multiple proteins are 
translated from the 4.5-kb (XI, X2, and E), 3.4-kb (M and 
X3), and 2.5-kb (X4 and X5) mRNAs. A consensus TRS is 
not found directly upstream of the ORF encoding the 
predicted E protein, and a monocistronic mRNA that would 
be predicted to code for E could not be clearly identified by 
Northern blot analysis. It is possible that the 3.6-kb band 
contains more than one mRNA species or that the monocis¬ 
tronic mRNA for E is a low-abundance message. 

TABLE 5 


Locations of SARS-CoV ORFs and sizes of proteins and mRNAs 
_Genome Location_ _Predicted Size 


ORF 

TRS a 

ORF Start 

ORF End 

Protein (aa) 

mRNA (nt)' 

la 

72 

265 

13,398 

4,378 

29,727 

lb 


13,398 

21,482 

2,695 


S 

21,491 

21,492 

25,256 

1,255 

8,308 c 

XI 

25,265 

25,268 

26,089 

274 

4,534 c 

X2 


25,689 

26,150 

154 


E 


26,117 

26,344 

76 


M 

26,353 

26,398 

27,060 

221 

3,446 c 

X3 


27,074 

27,262 

63 


X4 

27,272 

27,273 

27,638 

122 

2,527 c 


TABLE 5-continued 



Locations of SARS-CoV ORFs and sizes 

of proteins and mRNAs 


Genome Location 

Predicted Size 

ORF 

TRS a 

ORF Start ORF End 

Protein (aa) mRNA (nt) b 

X5 

N 

27,778 

28,111 

27,864 28,115 

28,120 29,385 

84 2,02 l d 

422 1,688° 


a The location is the 3'-most nucleotide in the consensus TRS. AAAC- 
GAAC. 

''Not including poly(A). Predicted size is based on the Position of the con¬ 
served TRS. 

c Corresponding mRNA detected by Northern blot analysis (FIG. 7C) 

“No mRNA corresponding to utilization of this consensus TRS was 
' 3 detected by Northern blot analysis (FIG. IC) 

Example 7 

20 Real-Time RT-PCR Assay for SARS-CoV 

Detection 

This example demonstrates the use of SARS-CoV-spe- 
cific primers and probes in a real-time RT-PCR assay to 
25 detect SARS-CoV in patient specimens. 

A variant of the real-time format, based on TaqMan probe 
hydrolysis technology (Applied Biosystems, Foster City, 
Calif.), was used to analyze a total of 340 clinical specimens 
collected from 246 persons with confirmed or suspected 
30 SARS-CoV infection. Specimens included oro- and 
nasopharyngeal swabs (dry and in viral transport media), 
sputa, nasal aspirates and washes, BAL, and lung tissue 
specimens collected at autopsy. 

Nucleic Acid Extraction 

35 SARS-CoV nucleic acids were recovered from clinical 
specimens using the automated NucliSens extraction System 
(bioMerieux, Durharn, N.C.). Following manufacturer’s 
instructions, specimens received in NucliSens lysis buffer 
were incubated at 37° C. for 30 min with intermittent 
411 rnixing, and 50 pL of silica Suspension, provided in the 
extraction kit, was added and mixed. The Contents of the 
tube were then transferred to a nucleic acid extraction 
cartridge and processed on an extractor Workstation. 
Approximately 40-50 uL of total nucleic acid eluate was 
4:1 recovered into nuclease-free vials and either tested imme¬ 
diately or stored at -70° C. 

Primers and Probes 

Multiple primer and probe sets were designed from the 
50 SARS-CoVpolymerase lb (nucleic acid 13,398 to 21,482 of 
SEQ ID NO: 1) and nucleocapsid gene (nucleic acid 28,120 
to 29,385 of SEQ ID NO: 1) sequences by using Primer 
Express Software Version 1.5 or 2.0.0 (Applied Biosystems, 
Foster City, Calif.) with the following default settings: 
55 primer melting temperature ( T M ) set at 60° C.; probe T M set 
at 10° C. greater than the primers at approximately 70° C.; 
and no guanidine residues permitted at the 5' probe termini. 
All primers and probes were synthesized by Standard phos- 
phoramidite chemistry techniques. TaqMan probes were 
60 labeled at the 5'-end with the reporter 6-FAM and at the 
3'-end with the quencher Blackhole Quencher 1 (Biosearch 
Technologies, Inc., Novato, Calif.). Optimal primer and 
probe concentrations were determined by cross-titration of 
serial twofold dilutions of each primer against a constant 
65 amount of purified SARS-CoV RNA. Primer and probe 
concentrations that gave the highest amplification efficien- 
cies were selected for fiirther study (Table 6). 
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TABLE 6 


Primers and probes used for real-time RT-PCR assays 3 


Assay ID Primer/probe 


Primaiy diagnostic assay 

SARS1 F 
R 
P 

SARS2 F 

R 
P 

SARS3 F 

R 
P 

To confirm positive results 

N3 F 

R 
P 

3-NTR F 

R 
P 

M F 

R 
P 


Sequence 


CATGTGTGGCGGCTCACTATAT (SEQ ID NO: 16) 
GACACTATTAGCATAAGCAGTTGTAGCA (SEQ ID NO: 17) 
TTAAACCAGGTGGAACATCATCCGGTG (SEQ ID NO: 18) 
GGAGCCTTGAÄTACACCCAAAG (SEQ ID NO: 19) 
GCACGGTGGCAGCATTG (SEQ ID NO: 20) 
CCACATTGGCACCCGCAATCC (SEQ ID NO: 21) 
CAAACATTGGCCGCAAATT (SEQ ID NO: 22) 

C'AATGC'GTGACATTC'CAAAGA (SEQ ID NO: 23) 
CACAATTTGCTCCAAGTGCCTCTGCA (SEQ ID NO: 24) 


GAAGTACCATCTGGGGCTGAG (SEQ ID NO: 25) 
CCGAAGAGCTACCCGACG (SEQ ID NO: 26) 
CTCTTTCATTTTGCCGTCACCACCAC (SEQ ID NO: 27) 
AGCTCTCCCTAGCATTATTCACTG (SEQ ID NO: 28) 
CACCACATTTTCATCGAGGC (SEQ ID NO: 29) 
TACCCTCGATCGTACTCCGCGT (SEQ ID NO: 30) 
TGTAGGCACTGATTCAGGTTTTG (SEQ ID NO: 31) 
CGGCGTGGTCTGTATTTAATTTA (SEQ ID NO: 32) 
CTGCATACAACCGCTACCGTATTGGAA (SEQ ID NO: 33) 


Genomic 

Region 

RNA Pol 

Nucleocapsid 

Nucleocapsid 

Nucleocapsid 

3-NTR 

M protein 


a RT-PCR, reverse transcription-polymerase chain reaction; F, forward primer; R, reverse primer; P, probe; NTR, 
nontranslated region. 


Real-Time RT-PCR Assay 

The real-time RT-PCR assay was performed by using the 
Real-Time One-Step RT-PCR Master Mix (Applied Biosys¬ 
tems, Foster City, Calif.). Each 25-pL reaction mixture 
contained 12.5 pL of 2x Master Mix, 0.625 pL of the 40x 
MultiScribe and RNase Inlribitor mix, 0.25 pL of 10 pM 
probe, 0.25 pL each of 50 pM forward and reverse primers, 
6.125 pL of nuclease-free water, and 5 pL of nucleic acid 
extract. Amplification was carried out in 96-well plates on an 
iCycler iQ Real-Time Detection System (Bio-Rad, Hercules, 
Calif.). Thermocycling conditions consisted of 30 minutes at 
48° C. for reverse transcription, 10 minutes at 95° C. for 
activation of the AmpliTaq Gold DNA polymerase, and 45 
cycles of 15 seconds at 95° C. and 1 minute at 60° C. Each 
run included one SARS-CoV genomic template control and 
at least two no-template Controls for the extraction (to check 
for contamination during sample processing) and one no- 
template control for the PCR-amplification Step. As a control 
for PCR inhibitors, and to monitor nucleic acid extraction 
efliciency, each sample was tested by real-time RT-PCR for 
the presence of the human ribonuclease (RNase) P gene 
(GenBank Accession No. NM_006413) by using the fol- 


lowing primers and probe: forward primer 5'-AGATTTG- 
30 GACCTGCGAGCG-3' (SEQ ID NO: 36); reverse primer 
5'-GAGCGGCTGTCTCCACAAGT-3' (SEQ ID NO: 37); 
probe 5'-TTCTGACC TGAAGGCTCTGCGCG-3' (SEQ ID 
NO: 38). Ute assay reaction was performed identically to 
that described above except that primer concentrations used 
35 were 30 pM each. Fluorescence measurements were taken 
and the threshold cycle (C r ) value for each sample was 
calculated by determining the point at which fluorescence 
exceeded a threshold limit set at the mean plus 10 Standard 
deviations above the baseline. A test result was considered 
40 positive if two or more of the SARS genomic targets showed 
positive results (C r ä=45 cycles) and all positive and negative 
control reactions gave expected values. 

While this disclosure has been described with an empha- 
sis upon preferred embodiments, it will be obvious to those 
of ordinary skill in the art that variations and equivalents of 
the preferred embodiments may be used and it is intended 
that the disclosure may be practiced otherwise than as 
specifically described herein. Accordingly, this disclosure 
includes all modifications encompassed within the spirit and 
scope of the disclosure as defined by the Claims below. 


SEQUENCE LISTING 


<160> NUMBER OF SEQ ID NOS: 38 

<210> SEQ ID NO 1 

<211> LENGTH: 29727 

<212> TYPE: DNA 

<213> ORGÄNISM: Coronavirus 

<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (265). . (13398 ) 
<223> OTHER INFORMATION: ORF la 
<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (13398 ).. (21482 ) 
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-continued 


<223> OTHER INFORMATION: ORF lb 
<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (21492)..(25256) 
<223> OTHER INFORMATION: ORF S 
<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (25268)..(26089 ) 
<223> OTHER INFORMATION: ORF XI 
<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (25689)..(26150 ) 
<223> OTHER INFORMATION: ORF X2 
<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (26117)..(26344) 
<223> OTHER INFORMATION: ORF E 
<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (26398)..(27060) 
<223> OTHER INFORMATION: ORF M 
<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (27074)..(27262 ) 
<223> OTHER INFORMATION: ORF X3 
<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (27273)..(27638 ) 
<223> OTHER INFORMATION: ORF X4 
<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (27864)..(28115) 
<223> OTHER INFORMATION: ORF X5 
<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (28120)..(29385 ) 
<223> OTHER INFORMATION: ORF N 


<400> SEQUENCE: 1 


ttattaggtt 

tttacctacc 

caggaaaagc 

caaccaacct 

cgatctcttg 

tagatctgtt 

60 

ctctaaacga 

actttaaaat 

ctgtgtagct 

gtcgctcggc 

tgcatgccta 

gtgcacctac 

120 

gcagtataaa 

caataataaa 

ttttactgtc 

gttgacaaga 

aacgagtaac 

tcgtccctct 

180 

tctgcagact 

gcttacggtt 

tcgtccgtgt 

tgcagtcgat 

catcagcata 

cctaggtttc 

240 

gtccgggtgt 

gaccgaaagg 

taagatggag 

agccttgttc 

ttggtgtcaa 

cgagaaaaca 

300 

cacgtccaac 

tcagtttgcc 

tgtccttcag 

gttagagacg 

tgctagtgcg 

tggcttcggg 

360 

gactctgtgg 

aagaggccct 

atcggaggca 

cgtgaacacc 

tcaaaaatgg 

cacttgtggt 

420 

ctagtagagc 

tggaaaaagg 

cgtactgccc 

cagcttgaac 

agccctatgt 

gttcattaaa 

480 

cgttctgatg 

ccttaagcac 

caatcacggc 

cacaaggtcg 

ttgagctggt 

tgcagaaatg 

540 

gacggcattc 

agtacggtcg 

tagcggtata 

acactgggag 

tactcgtgcc 

acatgtgggc 

600 

gaaaccccaa 

ttgcataccg 

caatgttctt 

cttcgtaaga 

acggtaataa 

gggageeggt 

660 

ggtcatagct 

atggcatcga 

tctaaagtct 

tatgacttag 

gtgacgagct 

tggcactgat 

720 

cccattgaag 

attatgaaca 

aaactggaac 

actaagcatg 

gcagtggtgc 

actccgtgaa 

780 

ctcactcgtg 

agctcaatgg 

aggtgcagtc 

actcgctatg 

tcgacaacaa 

tttctgtggc 

840 

ccagatgggt 

accctcttga 

ttgcatcaaa 

gattttctcg 

cacgcgcggg 

caagtcaatg 

900 

tgcactcttt 

ccgaacaact 

tgattacatc 

gagtcgaaga 

gaggtgtcta 

ctgctgccgt 

960 

gaccatgagc 

atgaaattgc 

ctggttcact 

gagcgctctg 

ataagagcta 

cgagcaccag 

1020 

acacccttcg 

aaattaagag 

tgccaagaaa 

tttgacactt 

tcaaagggga 

atgcccaaag 

1080 

tttgtgtttc 

ctcttaactc 

aaaagtcaaa 

gtcattcaac 

cacgtgttga 

aaagaaaaag 

1140 

actgagggtt 

tcatggggcg 

tatacgctct 

gtgtaccctg 

ttgcatctcc 

acaggagtgt 

1200 
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aacaatatgc 

acttgtctac 

cttgatgaaa 

tgtaatcatt 

gcgatgaagt 

ttcatggcag 

1260 

acgtgcgact 

ttctgaaagc 

cacttgtgaa 

cattgtggca 

ctgaaaattt 

agttattgaa 

1320 

ggacctacta 

catgtgggta 

cctacctact 

aatgctgtag 

tgaaaatgcc 

atgtcctgcc 

1380 

tgtcaagacc 

cagagattgg 

acctgagcat 

agtgttgcag 

attatcacaa 

ccactcaaac 

1440 

attgaaactc 

gactccgcaa 

gggaggtagg 

actagatgtt 

ttggaggctg 

tgtgtttgcc 

1500 

tatgttggct 

gctataataa 

gcgtgcctac 

-tgggttcctc 

gtgctagtgc 

tgatattggc 

1560 

tcaggccata 

ctggcattac 

tggtgacaat 

gtggagacct 

tgaatgagga 

tctccttgag 

1620 

atactgagtc 

gtgaacgtgt 

taacattaac 

attgttggcg 

attttcattt 

gaatgaagag 

1680 

gttgccatca 

ttttggcatc 

tttctctgct 

tctacaagtg 

cctttattga 

cactataaag 

1740 

agtcttgatt 

acaagtcttt 

caaaaccatt 

gttgagtcct 

gcggtaacta 

taaagttacc 

1800 

aagggaaage 

ccgtaaaagg 

tgcttggaac 

attggacaac 

agagatcagt 

tttaacacca 

1860 

ctgtgtggtt 

ttccctcaca 

ggctgctggt 

gttatcagat 

caatttttgc 

gcgcacactt 

1920 

gatgcagcaa 

accactcaat 

tcctgatttg 

caaagagcag 

ctgtcaccat 

acttgatggt 

1980 

atttctgaac 

agtcattacg 

tcttgtcgac 

gccatggttt 

atacttcaga 

cctgctcacc 

2040 

aacagtgtca 

ttattatggc 

atatgtaact 

ggtggtcttg 

tacaacagac 

ttctcagtgg 

2100 

ttgtctaatc 

ttttgggcac 

tactgttgaa 

aaactcaggc 

ctatctttga 

atggattgag 

2160 

gcgaaactta 

gtgcaggagt 

tgaatttctc 

aaggatgctt 

gggagattet 

caaatttctc 

2220 

attacaggtg 

tttttgacat 

cgtcaagggt 

caaatacagg 

ttgcttcaga 

taacatcaag 

2280 

gattgtgtaa 

aatgcttcat 

tgatgttgtt 

aacaaggcac 

tcgaaatgtg 

cattgatcaa 

2340 

gtcactatcg 

ctggcgcaaa 

gttgcgatca 

ctcaacttag 

gtgaagtctt 

catcgctcaa 

2400 

agcaagggac 

tttaccgtca 

gtgtatacgt 

ggcaaggagc 

agctgcaact 

actcatgcct 

2460 

cttaaggcac 

caaaagaagt 

aacctttctt 

gaaggtgatt 

cacatgacac 

agtacttacc 

2520 

tctgaggagg 

ttgttctcaa 

gaacggtgaa 

ctcgaagcac 

tcgagacgcc 

cgttgatagc 

2580 

ttcacaaatg 

gagctatcgt 

tggcacacca 

gtctgtgtaa 

atggcctcat 

gctcttagag 

2640 

attaaggaca 

aagaacaata 

ctgcgcattg 

tctcctggtt 

tactggctac 

aaacaatgtc 

2700 

tttcgcttaa 

aagggggtgc 

accaattaaa 

ggtgtaacct 

ttggagaaga 

tactgtttgg 

2760 

gaagttcaag 

gttacaagaa 

tgtgagaatc 

acatttgagc 

ttgatgaacg 

tgttgacaaa 

2820 

gtgcttaatg 

aaaagtgctc 

tgtctacact 

gttgaatccg 

gtaccgaagt 

tactgagttt 

2880 

gcatgtgttg 

tagcagaggc 

tgttgtgaag 

actttacaac 

cagtttctga 

tctccttacc 

2940 

aacatgggta 

ttgatcttga 

tgagtggagt 

gtagctacat 

tctacttatt 

tgatgatgct 

3000 

ggtgaagaaa 

acttttcatc 

acgtatgtat 

tgttcctttt 

accctccaga 

tgaggaagaa 

3060 

gaggacgatg 

cagagtgtga 

ggaagaagaa 

attgatgaaa 

cctgtgaaca 

tgagtacggt 

3120 

acagaggatg 

attatcaagg 

tctccctctg 

gaatttggtg 

cctcagctga 

aacagttcga 

3180 

gttgaggaag 

aagaagagga 

agactggctg 

gatgatacta 

ctgagcaatc 

agagattgag 

3240 

ccagaaccag 

aacctacacc 

tgaagaacca 

gttaatcagt 

ttactggtta 

tttaaaactt 

3300 

actgacaatg 

ttgccattaa 

atgtgttgac 

atcgttaagg 

aggcacaaag 

tgctaatcct 

3360 

atggtgattg 

taaatgctgc 

taacatacac 

ctgaaacatg 

gtggtggtgt 

agcaggtgca 

3420 

ctcaacaagg 

caaccaatgg 

tgccatgcaa 

aaggagagtg 

atgattacat 

taagctaaat 

3480 

ggccctctta 

cagtaggagg 

gtcttgtttg 

ctttctggac 

ataatcttgc 

taagaagtgt 

3540 
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ctgcatgttg 

ttggacctaa 

cctaaatgca 

ggtgaggaca 

tccagcttct 

taaggcagca 

3600 

tatgaaaatt 

tcaattcaca 

ggacatctta 

cttgcaccat 

tgttgtcagc 

aggcatattt 

3660 

ggtgctaaac 

cacttcagtc 

tttacaagtg 

tgcgtgcaga 

cggttcgtac 

acaggtttat 

3720 

attgcagtca 

atgacaaagc 

tctttatgag 

caggttgtca 

tggattatct 

tgataacctg 

3780 

aagcctagag 

tggaagcacc 

taaacaagag 

gagccaccaa 

acacagaaga 

ttccaaaact 

3840 

gaggagaaat 

ctgtcgtaca 

gaagcctgtc 

gatgtgaagc 

caaaaattaa 

ggcctgcatt 

3900 

gatgaggtta 

ccacaacact 

ggaagaaact 

aagtttctta 

ccaataagtt 

actcttgttt 

3960 

gctgatatca 

atggtaagct 

ttaccatgat 

tctcagaaca 

tgcttagagg 

tgaagatatg 

4020 

tctttccttg 

agaaggatgc 

accttacatg 

gtaggtgatg 

ttatcactag 

tggtgatatc 

4080 

acttgtgttg 

taataccctc 

caaaaaggct 

ggtggcacta 

ctgagatgct 

ctcaagagct 

4140 

ttgaagaaag 

tgccagttga 

tgagtatata 

accacgtacc 

ctggacaagg 

atgtgctggt 

4200 

tatacacttg 

aggaagctaa 

gactgctctt 

aagaaatgca 

aatctgcatt 

ttatgtacta 

4260 

ccttcagaag 

cacctaatgc 

taaggaagag 

attctaggaa 

ctgtatcctg 

gaatttgaga 

4320 

gaaatgcttg 

ctcatgctga 

agagacaaga 

aaattaatgc 

ctatatgcat 

ggatgttaga 

4380 

gccataatgg 

caaccatcca 

acgtaagtat 

aaaggaatta 

aaattcaaga 

gggcatcgtt 

4440 

gactatggtg 

tccgattctt 

cttttatact 

agtaaagagc 

ctgtagcttc 

tattattacg 

4500 

aagctgaact 

ctctaaatga 

gccgcttgtc 

acaatgccaa 

ttggttatgt 

gacacatggt 

4560 

tttaatcttg 

aagaggctgc 

gcgctgtatg 

cgttctctta 

aagctcctgc 

cgtagtgtca 

4620 

gtatcatcac 

cagatgctgt 

tactacatat 

aatggatacc 

tcacttcgtc 

atcaaagaca 

4680 

tctgaggagc 

actttgtaga 

aacagtttct 

ttggctggct 

cttacagaga 

ttggtcctat 

4740 

tcaggacagc 

gtacagagtt 

aggtgttgaa 

tttcttaagc 

gtggtgacaa 

aattgtgtac 

4800 

cacactctgg 

agagccccgt 

cgagtttcat 

cttgacggtg 

aggttctttc 

acttgacaaa 

4860 

ctaaagagtc 

tcttatccct 

gcgggaggtt 

aagactataa 

aagtgttcac 

aactgtggac 

4920 

aacactaatc 

tccacacaca 

gcttgtggat 

atgtctatga 

catatggaca 

gcagtttggt 

4980 

ccaacatact 

tggatggtgc 

tgatgttaca 

aaaattaaac 

ctcatgtaaa 

tcatgagggt 

5040 

aagactttct 

ttgtactacc 

tagtgatgac 

acactacgta 

gtgaagcttt 

cgagtactac 

5100 

catactcttg 

atgagagttt 

tcttggtagg 

tacatgtctg 

ctttaaacca 

cacaaagaaa 

5160 

tggaaatttc 

ctcaagttgg 

tggtttaact 

tcaattaaat 

gggctgataa 

caattgttat 

5220 

ttgtctagtg 

ttttattagc 

acttcaacag 

cttgaagtca 

aattcaatgc 

accagcactt 

5280 

caagaggctt 

attatagagc 

ccgtgctggt 

gatgctgcta 

acttttgtgc 

actcatactc 

5340 

gcttacagta 

ataaaactgt 

tggcgagctt 

ggtgatgtca 

gagaaactat 

gacccatctt 

5400 

ctacagcatg 

ctaatttgga 

atctgcaaag 

cgagttctta 

atgtggtgtg 

taaacattgt 

5460 

ggtcagaaaa 

ctactacctt 

aacgggtgta 

gaagctgtga 

tgtatatggg 

tactctatct 

5520 

tatgataatc 

ttaagacagg 

tgtttccatt 

ccatgtgtgt 

gtggtcgtga 

tgctacacaa 

5580 

tatctagtac 

aacaagagtc 

ttcttttgtt 

atgatgtctg 

caccacctgc 

tgagtataaa 

5640 

ttacagcaag 

gtacattctt 

atgtgcgaat 

gagtacactg 

gtaactatca 

gtgtggtcat 

5700 

tacactcata 

taactgctaa 

ggagaccctc 

tatcgtattg 

acggagctca 

ccttacaaag 

5760 

atgtcagagt 

acaaaggacc 

agtgactgat 

gttttctaca 

aggaaacatc 

ttacactaca 

5820 

accatcaagc 

ctgtgtcgta 

taaactcgat 

ggagttactt 

acacagagat 

tgaaccaaaa 

5880 

ttggatgggt 

attataaaaa 

ggataatgct 

tactatacag 

agcagcctat 

agaccttgta 

5940 
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ccaactcaac 

cattaccaaa 

tgcgagtttt 

gataatttca 

aactcacatg 

ttctaacaca 

6000 

aaatttgctg 

atgatttaaa 

tcaaatgaca 

ggcttcacaa 

agccagcttc 

acgagagcta 

6060 

tctgtcacat 

tcttcccaga 

cttgaatggc 

gatgtagtgg 

ctattgacta 

tagacactat 

6120 

tcagcgagtt 

tcaagaaagg 

tgctaaatta 

ctgcataagc 

caattgtttg 

gcacattaac 

6180 

caggctacaa 

ccaagacaac 

gttcaaacca 

aacacttggt 

gtttacgttg 

tctttggagt 

6240 

acaaagccag 

tagatacttc 

aaattcattt 

gaagttctgg 

cagtagaaga 

cacacaagga 

6300 

atggacaatc 

ttgcttgtga 

aagtcaacaa 

cccacctctg 

aagaagtagt 

ggaaaatcct 

6360 

accatacaga 

aggaagtcat 

agagtgtgac 

gtgaaaacta 

ccgaagttgt 

aggcaatgtc 

6420 

atacttaaac 

catcagatga 

aggtgttaaa 

gtaacacaag 

agttaggtca 

tgaggatctt 

6480 

atggctgctt 

atgtggaaaa 

cacaagcatt 

accattaaga 

aacctaatga 

gctttcacta 

6540 

gccttaggtt 

taaaaacaat 

tgccactcat 

ggtattgctg 

caattaatag 

tgttccttgg 

6600 

agtaaaattt 

tggcttatgt 

caaaccattc 

ttaggacaag 

cagcaattac 

aacatcaaat 

6660 

tgcgctaaga 

gattagcaca 

acgtgtgttt 

aacaattata 

tgccttatgt 

gtttacatta 

6720 

ttgttccaat 

tgtgtacttt 

tactaaaagt 

accaattcta 

gaattagagc 

ttcactacct 

6780 

acaactattg 

ctaaaaatag 

tgttaagagt 

gttgctaaat 

tatgtttgga 

tgccggcatt 

6840 

aattatgtga 

agtcacccaa 

attttctaaa 

ttgttcacaa 

tcgctatgtg 

gctattgttg 

6900 

ttaagtattt 

gcttaggttc 

tctaatctgt 

gtaactgctg 

cttttggtgt 

actcttatct 

6960 

aattttggtg 

ctccttctta 

ttgtaatggc 

gttagagaat 

tgtatcttaa 

ttcgtctaac 

7020 

gttactacta 

tggatttctg 

tgaaggttct 

tttccttgca 

gcatttgttt 

aagtggatta 

7080 

gactcccttg 

attcttatcc 

agctcttgaa 

accattcagg 

tgacgatttc 

atcgtacaag 

7140 

ctagacttga 

caattttagg 

tctggccgct 

gagtgggttt 

tggcatatat 

gttgttcaca 

7200 

aaattctttt 

atttattagg 

tctttcagct 

ataatgcagg 

tgttctttgg 

ctattttgct 

7260 

agtcatttca 

tcagcaattc 

ttggctcatg 

tggtttatca 

ttagtattgt 

acaaatggca 

7320 

cccgtttctg 

caatggttag 

gatgtacatc 

ttctttgctt 

ctttctacta 

catatggaag 

7380 

agctatgttc 

atatcatgga 

tggttgcacc 

tcttcgactt 

gcatgatgtg 

ctataagcgc 

7440 

aatcgtgcca 

cacgcgttga 

gtgtacaact 

attgttaatg 

gcatgaagag 

atctttctat 

7500 

gtctatgcaa 

atggaggccg 

tggcttctgc 

aagactcaca 

attggaattg 

tctcaattgt 

7560 

gacacatttt 

gcactggtag 

tacattcatt 

agtgatgaag 

ttgctcgtga 

tttgtcactc 

7620 

cagtttaaaa 

gaccaatcaa 

ccctactgac 

cagtcatcgt 

atattgttga 

tagtgttgct 

7680 

gtgaaaaatg 

gcgcgcttca 

cctctacttt 

gacaaggctg 

gtcaaaagac 

ctatgagaga 

7740 

catccgctct 

cccattttgt 

caatttagac 

aatttgagag 

ctaacaacac 

taaaggttca 

7800 

ctgcctatta 

atgtcatagt 

ttttgatggc 

aagtccaaat 

gcgacgagtc 

tgcttctaag 

7860 

tctgcttctg 

tgtactacag 

tcagctgatg 

tgccaaccta 

ttctgttgct 

tgaccaagtt 

7920 

cttgtatcag 

acgttggaga 

tagtactgaa 

gtttccgtta 

agatgtttga 

tgcttatgtc 

7980 

gacacctttt 

cagcaacttt 

tagtgttcct 

atggaaaaac 

ttaaggcact 

tgttgctaca 

8040 

gctcacagcg 

agttagcaaa 

gggtgtaget 

ttagatggtg 

tcctttctac 

attcgtgtca 

8100 

gctgcccgac 

aaggtgttgt 

tgataccgat 

gttgacacaa 

aggatgttat 

tgaatgtctc 

8160 

aaactttcac 

atcactctga 

cttagaagtg 

acaggtgaca 

gttgtaacaa 

tttcatgctc 

8220 

acctataata 

aggttgaaaa 

catgacgccc 

agagatcttg 

gcgcatgtat 

tgactgtaat 

8280 
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gcaaggcata 

tcaatgccca 

agtagcaaaa 

agtcacaatg 

tttcactcat 

ctggaatgta 

8340 

aaagactaca 

tgtctttatc 

tgaacagctg 

cgtaaacaaa 

ttcgtagtgc 

tgccaagaag 

8400 

aacaacatac 

cttttagact 

aacttgtgct 

acaactagac 

aggttgtcaa 

tgtcataact 

8460 

actaaaatct 

cactcaaggg 

tggtaagatt 

gttagtactt 

gttttaaact 

tatgcttaag 

8520 

gccacattat 

tgtgcgttct 

tgctgcattg 

gtttgttata 

tcgttatgcc 

agtacataca 

8580 

ttgtcaatcc 

atgatggtta 

cacaaatgaa 

atcattggtt 

acaaagccat 

tcaggatggt 

8640 

gtcactcgtg 

acatcatttc 

tactgatgat 

tgttttgcaa 

ataaacatgc 

tggttttgac 

8700 

gcatggttta 

gccagcgtgg 

tggttcatac 

aaaaatgaca 

aaagctgccc 

tgtagtagct 

8760 

gctatcatta 

caagagagat 

tggtttcata 

gtgcctggct 

taccgggtac 

tgtgctgaga 

8820 

gcaatcaatg 

gtgacttctt 

gcattttcta 

cctcgtgttt 

ttagtgctgt 

tggcaacatt 

8880 

tgctacacac 

cttccaaact 

cattgagtat 

agtgattttg 

ctacctctgc 

ttgcgttctt 

8940 

gctgctgagt 

gtacaatttt 

taaggatgct 

atgggcaaac 

ctgtgccata 

ttgttatgac 

9000 

actaatttgc 

tagagggttc 

tatttcttat 

agtgagcttc 

gtccagacac 

tcgttatgtg 

9060 

cttatggatg 

gttccatcat 

acagtttcct 

aacacttacc 

tggagggtte 

tgttagagta 

9120 

gtaacaactt 

ttgatgctga 

gtactgtaga 

catggtacat 

gcgaaaggtc 

agaagtaggt 

9180 

atttgcctat 

ctaccagtgg 

tagatgggtt 

cttaataatg 

agcattacag 

agctctatca 

9240 

ggagttttct 

gtggtgttga 

tgcgatgaat 

ctcatagcta 

acatctttac 

tcctcttgtg 

9300 

caacctgtgg 

gtgctttaga 

tgtgtctgct 

tcagtagtgg 

ctggtggtat 

tattgccata 

9360 

ttggtgactt 

gtgctgccta 

ctactttatg 

aaattcagac 

gtgtttttgg 

tgagtacaac 

9420 

catgttgttg 

ctgctaatgc 

acttttgttt 

ttgatgtctt 

tcactatact 

ctgtctggta 

9480 

ccagcttaca 

gctttctgcc 

gggagtctac 

tcagtctttt 

acttgtactt 

gacattctat 

9540 

ttcaccaatg 

atgtttcatt 

cttggctcac 

cttcaatggt 

ttgccatgtt 

ttctcctatt 

9600 

gtgccttttt 

ggataacagc 

aatctatgta 

ttctgtattt 

ctctgaagca 

ctgccattgg 

9660 

ttctttaaca 

actatcttag 

gaaaagagtc 

atgtttaatg 

gagttacatt 

tagtaccttc 

9720 

gaggaggctg 

ctttgtgtac 

ctttttgctc 

aacaaggaaa 

tgtacctaaa 

attgcgtagc 

9780 

gagacactgt 

tgccacttac 

acagtataac 

aggtatcttg 

ctctatataa 

caagtacaag 

9840 

tatttcagtg 

gagccttaga 

tactaccagc 

tatcgtgaag 

cagcttgctg 

ccacttagca 

9900 

aaggctctaa 

atgactttag 

caactcaggt 

gctgatgttc 

tctaccaacc 

accacagaca 

9960 

tcaatcactt 

ctgctgttct 

gcagagtggt 

tttaggaaaa 

tggcattccc 

gtcaggcaaa 

10020 

gttgaagggt 

gcatggtaca 

agtaacctgt 

ggaactacaa 

ctcttaatgg 

attgtggttg 

10080 

gatgacacag 

tatactgtcc 

aagacatgtc 

atttgcacag 

cagaagacat 

gcttaatcct 

10140 

aactatgaag 

atctgctcat 

tcgcaaatcc 

aaccatagct 

ttcttgttca 

ggctggcaat 

10200 

gttcaacttc 

gtgttattgg 

ccattctatg 

caaaattgtc 

tgcttaggct 

taaagttgat 

10260 

acttctaacc 

ctaagacacc 

caagtataaa 

tttgtccgta 

tccaacctgg 

tcaaacattt 

10320 

tcagttctag 

catgctacaa 

tggttcacca 

tctggtgttt 

atcagtgtgc 

catgagacct 

10380 

aatcatacca 

ttaaaggttc 

tttccttaat 

ggatcatgtg 

gtagtgttgg 

ttttaacatt 

10440 

gattatgatt 

gcgtgtcttt 

ctgctatatg 

catcatatgg 

agcttccaac 

aggagtaeac: 

10500 

gctggtactg 

acttagaagg 

taaattctat 

ggtccatttg 

ttgacagaca 

aactgcacag 

10560 

gctgcaggta 

cagacacaac 

cataacatta 

aatgttttgg 

catggctgta 

tgctgctgtt 

10620 

atcaatggtg 

ataggtggtt 

tcttaataga 

ttcaccacta 

ctttgaatga 

ctttaacctt 

10680 
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gtggcaatga 

agtacaacta 

tgaacctttg 

acacaagatc 

atgttgacat 

attgggacct 

10740 

ctttctgctc 

aaacaggaat 

tgccgtctta 

gatatgtgtg 

ctgctttgaa 

agagctgctg 

10800 

cagaatggta 

tgaatggtcg 

tactatcctt 

ggtagcacta 

ttttagaaga 

tgagtttaca 

10860 

ccatttgatg 

ttgttagaca 

atgctctggt 

gttaccttcc 

aaggtaagtt 

caagaaaatt 

10920 

gttaagggca 

ctcatcattg 

gatgctttta 

actttcttga 

catcactatt 

gattcttgtt 

10980 

caaagtacac 

agtggtcact 

gtttttcttt 

gtttacgaga 

atgctttctt 

gccatttact 

11040 

cttggtatta 

tggcaattgc 

tgcatgtgct 

atgctgcttg 

ttaagcataa 

gcacgcattc 

11100 

ttgtgcttgt 

ttctgttacc 

ttctcttgca 

acagttgctt 

actttaatat 

ggtctacatg 

11160 

cctgctagct 

gggtgatgog 

tatcatgaca 

tggcttgaat 

tggctgacac 

tagcttgtct 

11220 

ggttataggc 

ttaaggattg 

tgttatgtat 

gcttcagctt 

tagttttgct 

tattctcatg 

11280 

acagctcgca 

ctgtttatga 

tgatgctgct 

agacgtgttt 

ggacactgat 

gaatgtcatt 

11340 

acacttgttt 

acaaagtcta 

ctatggtaat 

gctttagatc 

aagctatttc 

catgtgggcc 

11400 

ttagttattt 

ctgtaacctc 

taactattct 

ggtgtcgtta 

cgactatcat 

gtttttagct 

11460 

agagctatag 

tgtttgtgtg 

tgttgagtat 

tacccattgt 

tatttattac 

tggcaacacc 

11520 

ttacagtgta 

tcatgcttgt 

ttattgtttc 

ttaggctatt 

gttgctgctg 

ctactttggc 

11580 

cttttctgtt 

tactcaaccg 

ttacttcagg 

cttactcttg 

gtgtttatga 

ctacttggtc 

11640 

tctacacaag 

aatttaggta 

tatgaactcc 

caggggcttt 

tgcctcctaa 

gagtagtatt 

11700 

gatgctttca 

agcttaacat 

taagttgttg 

ggtattggag 

gtaaaccatg 

tatcaaggtt 

11760 

gctactgtac 

agtctaaaat 

gtctgacgta 

aagtgcacat 

ctgtggtact 

gctctcggtt 

11820 

cttcaacaac 

ttagagtaga 

gtcatcttct 

aaattgtggg 

cacaatgtgt 

acaactccac 

11880 

aatgatattc 

ttcttgcaaa 

agacacaact 

gaagctttcg 

agaagatggt 

ttctcttttg 

11940 

tctgttttgc 

tatccatgca 

gggtgctgta 

gacattaata 

ggttgtgcga 

ggaaatgctc 

12000 

gataaccgtg 

ctactcttca 

ggctattgct 

tcagaattta 

gttctttacc 

atcatatgcc 

12060 

gcttatgcca 

ctgcccagga 

ggcctatgag 

caggctgtag 

ctaatggtga 

ttctgaagtc 

12120 

gttctcaaaa 

agttaaagaa 

atctttgaat 

gtggctaaat 

ctgagtttga 

ccgtgatgct 

12180 

gccatgcaac 

gcaagttgga 

aaagatggca 

gatcaggcta 

tgacccaaat 

gtacaaacag 

12240 

gcaagatctg 

aggacaagag 

ggcaaaagta 

actagtgcta 

tgcaaacaat 

gctcttcact 

12300 

atgcttagga 

agcttgataa 

tgatgcactt 

aacaacatta 

tcaacaatgc 

gcgtgatggt 

12360 

tgtgttccac 

tcaacatcat 

accattgact 

acagcagcca 

aactcatggt 

tgttgtccct 

12420 

gattatggta 

cctacaagaa 

cacttgtgat 

ggtaacacct 

ttacatatgc 

atctgcactc 

12480 

tgggaaatcc 

agcaagttgt 

tgatgcggat 

agcaagattg 

ttcaacttag 

tgaaattaac 

12540 

atggacaatt 

caccaaattt 

ggcttggcct 

cttattgtta 

cagctctaag 

agccaactca 

12600 

gctgttaaac 

tacagaataa 

tgaactgagt 

ccagtagcac 

tacgacagat 

gtcctgtgcg 

12660 

gctggtacca 

cacaaacagc 

ttgtactgat 

gacaatgcac 

ttgcctacta 

taacaattcg 

12720 

aagggaggta 

ggtttgtgct 

ggcattacta 

tcagaccacc 

aagatctcaa 

atgggetaga 

12780 

ttccctaaga 

gtgatggtac 

aggtacaatt 

tacacagaac 

tggaaccacc 

ttgtaggttt 

12840 

gttacagaca 

caccaaaagg 

gcctaaagtg 

aaatacttgt 

acttcatcaa 

aggcttaaac 

12900 

aacctaaata 

gaggtatggt 

gctgggcagt 

ttagctgcta 

cagtacgtct 

tcaggctgga 

12960 

aatgctacag 

aagtacctgc 

caattcaact 

gtgctttcct 

tctgtgcttt 

tgcagtagac 

13020 
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cctgctaaag 

catataagga 

ttacctagca 

agtggaggac 

aaccaatcac 

caactgtgtg 

13080 

aagatgttgt 

gtacacacac 

tggtacagga 

caggcaatta 

ctgtaacacc 

agaagctaac 

13140 

atggaccaag 

agtcctttgg 

tggtgGttea 

tgttgtctgt 

attgtagatg 

ccacattgac 

13200 

catccaaatc 

ctaaaggatt 

ctgtgacttg 

aaaggtaagt 

acgtccaaat 

acctaccact 

13260 

tgtgctaatg 

acccagtggg 

ttttacactt 

agaaacacag 

tctgtaccgt 

ctgcggaatg 

13320 

tggaaaggtt 

atggctgtag 

ttgtgaccaa 

ctccgcgaac 

ccttgatgca 

gtctgcggat 

13380 

gcatcaacgt 

ttttaaacgg 

gtttgcggtg 

taagtgcagc 

ccgtcttaca 

ccgtgcggca 

13440 

caggcactag 

tactgatgtc 

gtctacaggg 

cttttgatat 

ttacaacgaa 

aaagttgctg 

13500 

gttttgcaaa 

gttcctaaaa 

actaattgct 

gtcgcttcca 

ggagaaggat 

gaggaaggca 

13560 

atttattaga 

ctcttacttt 

gtagttaaga 

ggcatactat 

gtctaactac 

caacatgaag 

13620 

agactattta 

taacttggtt 

aaagattgtc 

cagcggttgc 

tgtccatgac 

tttttcaagt 

13680 

ttagagtaga 

tggtgaeatg 

gtaccacata 

tatcacgtca 

gcgtctaact 

aaatacacaa 

13740 

tggctgattt 

agtctatgct 

ctacgtcatt 

ttgatgaggg 

taattgtgat 

acattaaaag 

13800 

aaatactcgt 

cacatacaat 

tgctgtgatg 

atgattattt 

caataagaag 

gattggtatg 

13860 

acttcgtaga 

gaatcctgac 

atcttacgcg 

tatatgctaa 

cttaggtgag 

cgtgtacgcc 

13920 

aatcattatt 

aaagactgta 

caattctgcg 

atgctatgcg 

tgatgcaggc 

attgtaggcg 

13980 

tactgacatt 

agataatcag 

gatcttaatg 

ggaactggta 

cgatttcggt 

gatttcgtac 

14040 

aagtagcacc 

aggctgcgga 

gttcctattg 

tggattcata 

ttactcattg 

ctgatgccca 

14100 

tcctcacttt 

gactagggca 

ttggctgctg 

agtcccatat 

ggatgctgat 

ctcgcaaaac 

14160 

cacttattaa 

gtgggatttg 

ctgaaatatg 

attttacgga 

agagagactt 

tgtctcttcg 

14220 

accgttattt 

taaatattgg 

gaccagacat 

accatcccaa 

ttgtattaac 

tgtttggatg 

14280 

ataggtgtat 

ccttcattgt 

gcaaacttta 

atgtgttatt 

ttctactgtg 

tttccaccta 

14340 

caagttttgg 

accactagta 

agaaaaatat 

ttgtagatgg 

tgttcctttt 

gttgtttcaa 

14400 

ctggatacca 

ttttcgtgag 

ttaggagtcg 

tacataatca 

ggatgtaaac 

ttacatagct 

14460 

cgcgtctcag 

tttcaaggaa 

cttttagtgt 

atgctgctga 

tccagctatg 

catgcagctt 

14520 

ctggcaattt 

attgctagat 

aaacgcacta 

catgcttttc 

agtagctgca 

ctaacaaaca 

14580 

atgttgcttt 

tcaaactgtc 

aaacccggta 

attttaataa 

agacttttat 

gactttgctg 

14640 

tgtctaaagg 

tttctttaag 

gaaggaagtt 

ctgttgaact 

aaaacacttc 

ttctttgctc 

14700 

aggatggcaa 

cgctgctatc 

agtgattatg 

actattatcg 

ttataatctg 

ccaacaatgt 

14760 

gtgatatcag 

acaactccta 

ttcgtagttg 

aagttgttga 

taaatacttt 

gattgttacg 

14820 

atggtggctg 

tattaatgcc 

aaccaagtaa 

tcgttaacaa 

tctggataaa 

tcagctggtt 

14880 

tcccatttaa 

taaatggggt 

aaggctagac 

tttattatga 

ctcaatgagt 

tatgaggatc 

14940 

aagatgcact 

tttcgcgtat 

actaagcgta 

atgtcatccc 

tactataact 

caaatgaatc 

15000 

ttaagtatgc 

cattagtgca 

aagaatagag 

ctcgcaccgt 

agctggtgtc 

tctatctgta 

15060 

gtactatgac 

aaatagacag 

tttcatcaga 

aattattgaa 

gtcaatagcc 

gccactagag 

15120 

gagctactgt 

ggtaattgga 

acaagcaagt 

tttacggtgg 

ctggcataat 

atgttaaaaa 

15180 

ctgtttacag 

tgatgtagaa 

actccacacc 

ttatgggttg 

ggattatcca 

aaatgtgaca 

15240 

gagccatgcc 

taacatgctt 

aggataatgg 

cctctcttgt 

tcttgctcgc 

aaacataaca 

15300 

cttgctgtaa 

cttatcacac 

cgtttctaca 

ggttagctaa 

cgagtgtgcg 

caagtattaa 

15360 

gtgagatggt 

catgtgtggc 

ggctcactat 

atgttaaacc 

aggtggaaea 

tcatccggtg 

15420 
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atgctacaac 

tgcttatgct 

aatagtgtct 

ttaacatttg 

tcaagctgtt 

acagccaatg 

15480 

taaatgcact 

tctttcaact 

gatggtaata 

agatagctga 

caagtatgtc 

cgcaatctac 

15540 

aacacaggct 

ctatgagtgt 

ctctatagaa 

atagggatgt 

tgatcatgaa 

ttcgtggatg 

15600 

agttttacgc 

ttacctgcgt 

aaacatttct 

ccatgatgat 

tctttctgat 

gatgccgttg 

15660 

tgtgctataa 

cagtaactat 

gcggctcaag 

gtttagtagc 

tagcattaag 

aactttaagg 

15720 

cagttcttta 

ttatcaaaat 

aatgtgttca 

tgtctgaggc 

aaaatgttgg 

actgagactg 

15780 

accttactaa 

aggacctcac 

gaattttgct 

cacagcatac 

aatgctagtt 

aaacaaggag 

15840 

atgattacgt 

gtacctgcct 

tacccagatc 

catcaagaat 

attaggcgca 

ggctgttttg 

15900 

tcgatgatat 

tgtcaaaaca 

gatggtacac 

ttatgattga 

aaggttcgtg 

tcactggcta 

15960 

ttgatgctta 

cccacttaca 

aaacatccta 

atcaggagta 

tgctgatgtc 

tttcacttgt 

16020 

atttacaata 

cattagaaag 

ttacatgatg 

agcttactgg 

ccacatgttg 

gacatgtatt 

16080 

ccgtaatgct 

aactaatgat 

aacacctcac 

ggtactggga 

acctgagttt 

tatgaggcta 

16140 

tgtacacacc 

acatacagtc 

ttgcaggctg 

taggtgcttg 

tgtattgtgc 

aattcacaga 

16200 

cttcacttcg 

ttgcggtgcc 

tgtattagga 

gaccattcct 

atgttgcaag 

tgctgctatg 

16260 

accatgtcat 

ttcaacatca 

cacaaattag 

tgttgtctgt 

taatccctat 

gtttgcaatg 

16320 

ccccaggttg 

tgatgtcact 

gatgtgacac 

aactgtatct 

aggaggtatg 

agctattatt 

16380 

gcaagtcaca 

taagcctccc 

attagttttc 

cattatgtgc 

taatggtcag 

gtttttggtt 

16440 

tatacaaaaa 

cacatgtgta 

ggcagtgaca 

atgtcactga 

cttcaatgcg 

atagcaacat 

16500 

gtgattggac 

taatgctggc 

gattacatac 

ttgccaacac 

ttgtactgag 

agactcaagc 

16560 

ttttcgcagc 

agaaacgctc 

aaagccactg 

aggaaacatt 

taagctgtca 

tatggtattg 

16620 

ctactgtacg 

cgaagtactc 

tctgacagag 

aattgcatct 

ttcatgggag 

gttggaaaac 

16680 

ctagaccacc 

attgaacaga 

aactatgtct 

ttactggtta 

ccgtgtaact 

aaaaatagta 

16740 

aagtacagat 

tggagagtac 

acctttgaaa 

aaggtgacta 

tggtgatget 

gttgtgtaca 

16800 

gaggtactac 

gacatacaag 

ttgaatgttg 

gtgattactt 

tgtgttgaca 

tctcacactg 

16860 

taatgccact 

tagtgcacct 

actctagtgc 

cacaagagca 

ctatgtgaga 

attactggct 

16920 

tgtacccaac 

actcaacatc 

tcagatgagt 

tttctagcaa 

tgttgcaaat 

tatcaaaagg 

16980 

tcggcatgca 

aaagtactct 

acactccaag 

gaccacctgg 

tactggtaag 

agtcattttg 

17040 

ccatcggact 

tgctctctat 

tacccatctg 

ctcgcatagt 

gtatacggca 

tgctctcatg 

17100 

cagctgttga 

tgccctatgt 

gaaaaggcat 

taaaatattt 

gcccatagat 

aaatgtagta 

17160 

gaatcatacc 

tgcgcgtgcg 

cgcgtagagt 

gttttgataa 

attcaaagtg 

aattcaacac 

17220 

tagaacagta 

tgttttctgc 

actgtaaatg 

cattgccaga 

aacaactgct 

gacattgtag 

17280 

tctttgatga 

aatctctatg 

gctactaatt 

atgacttgag 

tgttgtcaat 

gctagacttc 

17340 

gtgcaaaaca 

ctacgtctat 

attggcgatc 

ctgctcaatt 

accagccccc 

cgcacattgc 

17400 

tgactaaagg 

cacactagaa 

ccagaatatt 

ttaattcagt 

gtgcagactt 

atgaaaacaa 

17460 

taggtccaga 

catgttcctt 

ggaacttgtc 

gccgttgtcc 

tgctgaaatt 

gttgacactg 

17520 

tgagtgcttt 

agtttatgac 

aataagctaa 

aagcacacaa 

ggataagtca 

gctcaatgct 

17580 

tcaaaatgtt 

ctacaaaggt 

gttattacac 

atgatgtttc 

atctgcaatc 

aacagacctc 

17640 

aaataggcgt 

tgtaagagaa 

tttcttacac 

gcaatcctgc 

ttggagaaaa 

gctgtt.tt.ta 

17700 

tctcacctta 

taattcacag 

aacgctgtag 

cttcaaaaat 

cttaggattg 

cctacgcaga 

17760 
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ctgttgatte 

atcacagggt 

tctgaatatg 

actatgtcat 

attcacacaa 

actactgaaa 

17820 

cagcacactc 

ttgtaatgtc 

aaccgcttca 

atgtggctat 

cacaagggca 

aaaattggca 

17880 

ttttgtgcat 

aatgtctgat 

agagatcttt 

atgacaaact 

gcaatttaca 

agtctagaaa 

17940 

taccacgtcg 

caatgtggct 

acattacaag 

cagaaaatgt 

aactggactt 

tttaaggact 

18000 

gtagtaagat 

cattactggt 

cttcatccta 

cacaggcacc 

tacacacctc 

agcgttgata 

18060 

taaagttcaa 

gactgaagga 

ttatgtgttg 

acataccagg 

cataccaaag 

gacatgacct 

18120 

accgtagact 

catctctatg 

atgggtttca 

aaatgaatta 

ccaagtcaat 

ggttacccta 

18180 

atatgtttat 

cacccgcgaa 

gaagctattc 

gtcacgttcg 

tgcgtggatt 

ggctttgatg 

18240 

tagagggctg 

tcatgcaact 

agagatgctg 

tgggtactaa 

cctacctctc 

cagctaggat 

18300 

tttctacagg 

tgttaactta 

gtagctgtac 

cgactggtta 

tgttgacact 

gaaaataaca 

18360 

cagaattcac 

cagagttaat 

gcaaaacctc 

caccaggtga 

ccagtttaaa 

catcttatac 

18420 

cactcatgta 

taaaggcttg 

ccctggaatg 

tagtgcgtat 

taagatagta 

caaatgctca 

18480 

gtgatacact 

gaaaggattg 

tcagacagag 

tcgtgttcgt 

cctttgggcg 

catggctttg 

18540 

agcttacatc 

aatgaagtac 

tttgtcaaga 

ttggacctga 

aagaacgtgt 

tgtctgtgtg 

18600 

acaaacgtgc 

aacttgcttt 

tctacttcat 

cagatactta 

tgcctgctgg 

aatcattctg 

18660 

tgggttttga 

ctatgtctat 

aacccattta 

tgattgatgt 

tcagcagtgg 

ggctttacgg 

18720 

gtaaccttca 

gagtaaccat 

gaccaacatt 

gccaggtaca 

tggaaatgca 

catgtggcta 

18780 

gttgtgatgc 

tatcatgact 

agatgtttag 

cagtccatga 

gtgctttgtt 

aagcgcgttg 

18840 

attggtctgt 

tgaataccct 

attataggag 

atgaactgag 

ggttaattct 

gcttgcagaa 

18900 

aagtacaaca 

catggttgtg 

aagtctgcat 

tgcttgctga 

taagtttcca 

gttcttcatg 

18960 

acattggaaa 

tccaaaggct 

atcaagtgtg 

tgcctcaggc 

tgaagtagaa 

tggaagttct 

19020 

acgatgctca 

gccatgtagt 

gacaaagctt 

acaaaataga 

ggagctcttc 

tattcttatg 

19080 

ctacacatca 

cgataaattc 

actgatggtg 

tttgtttgtt 

ttggaattgt 

aacgttgatc 

19140 

gttacccagc 

caatgcaatt 

gtgtgtaggt 

ttgacacaag 

agccttgtca 

aacttgaact 

19200 

taccaggctg 

tgatggtggt 

agtttgtatg 

tgaataagca 

tgcattccac 

actccagctt 

19260 

tcgataaaag 

tgcatttact 

aatttaaagc 

aattgccttt 

cttttactat 

tctgatagtc 

19320 

cttgtgagtc 

tcatggcaaa 

caagtagtgt 

cggatattga 

ttatgttcca 

ctcaaatctg 

19380 

ctacgtgtat 

tacacgatgc 

aatttaggtg 

gtgctgtttg 

cagacaccat 

gcaaatgagt 

19440 

accgacagta 

cttggatgca 

tataatatga 

tgatttctgc 

tggatttagc 

ctatggattt 

19500 

acaaacaatt 

tgatacttat 

aacctgtgga 

atacatttac 

caggttacag 

agtttagaaa 

19560 

atgtggctta 

taatgttgtt 

aataaaggac 

actttgatgg 

acacgccggc 

gaagcacctg 

19620 

tttccatcat 

taataatgct 

gtttacacaa 

aggtagatgg 

tattgatgtg 

gagatctttg 

19680 

aaaataagac 

aacacttcct 

gttaatgttg 

catttgagct 

ttgggctaag 

cgtaacatta 

19740 

aaccagtgcc 

agagattaag 

atactcaata 

atttgggtgt 

tgatatcgct 

gctaatactg 

19800 

taatctggga 

ctacaaaaga 

gaagccccag 

cacatgtatc 

tacaataggt 

gtctgcacaa 

19860 

tgactgacat 

tgccaagaaa 

cctactgaga 

gtgcttgttc 

ttcacttact 

gtcttgtttg 

19920 

atggtagagt 

ggaaggacag 

gtagaccttt 

ttagaaacgc 

ccgtaatggt 

gttttaataa 

19980 

cagaaggttc 

agtcaaaggt 

ctaacacctt 

caaagggacc 

agcacaagct 

agcgtcaatg 

20040 

gagtcacatt 

aattggagaa 

tcagtaaaaa 

cacagtttaa 

ctactttaag 

aaagtagacg 

20100 

gcattattca 

acagttgcct 

gaaacctact 

ttactcagag 

cagagactta 

gaggatttta 

20160 
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agcccagatc 

acaaatggaa 

actgactttc 

tcgagctcgc 

tatggatgaa 

ttcatacagc 

20220 

gatataagct 

cgagggctat 

gccttcgaac 

acatcgttta 

tggagatttc 

agtcatggac 

20280 

aacttggcgg 

tcttcattta 

atgataggct 

tagccaagcg 

ctcacaagat 

tcaccactta 

20340 

aattagagga 

ttttatccct 

atggacagca 

cagtgaaaaa 

ttacttcata 

acagatgcgc 

20400 

aaacaggttc 

atcaaaatgt 

gtgtgttctg 

tgattgatct 

tttacttgat 

gactttgtcg 

20460 

agataataaa 

gtcacaagat 

ttgtcagtga 

tttcaaaagt 

ggtcaaggtt 

acaattgact 

20520 

atgctgaaat 

ttcattcatg 

ctttggtgta 

aggatggaca 

tgttgaaacc 

ttctacccaa 

20580 

aactacaagc 

aagtcaagcg 

tggcaaccag 

gtgttgcgat 

gcctaacttg 

tacaagatgc 

20640 

aaagaatgct 

tcttgaaaag 

tgtgaccttc 

agaattatgg 

tgaaaatgct 

gttataccaa 

20700 

aaggaataat 

gatgaatgtc 

gcaaagtata 

ctcaactgtg 

tcaatactta 

aatacactta 

20760 

ctttagctgt 

accctacaac 

atgagagtta 

ttcactttgg 

tgctggctct 

gataaaggag 

20820 

ttgcaccagg 

tacagctgtg 

ctcagacaat 

ggttgccaac 

tggcacacta 

cttgtcgatt 

20880 

cagatcttaa 

tgacttcgtc 

tccgacgcag 

attctacttt 

aattggagac 

tgtgcaacag 

20940 

tacatacggc 

taataaatgg 

gaccttatta 

ttagcgatat 

gtatgaccct 

aggaccaaac 

21000 

atgtgacaaa 

agagaatgac 

tctaaagaag 

ggtttttcac 

ttatctgtgt 

ggatttataa 

21060 

agcaaaaact 

agccctgggt 

ggttctatag 

ctgtaaagat 

aacagagcat 

tcttggaatg 

21120 

ctgaccttta 

caagcttatg 

ggccatttct 

catggtggac 

agcttttgtt 

acaaatgtaa 

21180 

atgcatcatc 

atcggaagca 

tttttaattg 

gggctaacta 

tcttggcaag 

ccgaaggaac 

21240 

aaattgatgg 

ctataccatg 

catgctaact 

acattttctg 

gaggaacaca 

aatcctatcc 

21300 

agttgtcttc 

ctattcactc 

tttgacatga 

gcaaatttcc 

tcttaaatta 

agaggaaetg 

21360 

ctgtaatgtc 

tcttaaggag 

aatcaaatca 

atgatatgat 

ttattctctt 

ctggaaaaag 

21420 

gtaggcttat 

cattagagaa 

aacaacagag 

ttgtggtttc 

aagtgatatt 

cttgttaaca 

21480 

actaaacgaa 

catgtttatt 

ttcttattat 

ttcttactct 

cactagtggt 

agtgaccttg 

21540 

accggtgcac 

cacttttgat 

gatgttcaag 

ctcctaatta 

cactcaacat 

acttcatcta 

21600 

tgaggggggt 

ttactatcct 

gatgaaattt 

ttagatcaga 

cactctttat 

ttaactcagg 

21660 

atttatttct 

tccattttat 

tctaatgtta 

cagggtttca 

tactattaat 

catacgtttg 

21720 

gcaaccctgt 

catacctttt 

aaggatggta 

tttattttgc 

tgccacagag 

aaatcaaatg 

21780 

ttgtccgtgg 

ttgggttttt 

ggttctacca 

tgaacaacaa 

gtcacagtcg 

gtgattatta 

21840 

ttaacaattc 

tactaatgtt 

gttatacgag 

catgtaactt 

tgaattgtgt 

gacaaccctt 

21900 

tctttgctgt 

ttctaaaccc 

atgggtacac 

agacacatac 

tatgatattc 

gataatgcat 

21960 

ttaattgcac 

tttcgagtac 

atatctgatg 

ccttttcgct 

tgatgtttca 

gaaaagtcag 

22020 

gtaattttaa 

acacttacga 

gagtttgtgt 

ttaaaaataa 

agatgggttt 

ctctatgttt 

22080 

ataagggcta 

tcaacctata 

gatgtagttc 

gtgatctacc 

ttctggtttt 

aacactttga 

22140 

aacctatttt 

taagttgcct 

cttggtatta 

acattacaaa 

ttttagagcc 

attcttacag 

22200 

ccttttcacc 

tgctcaagac 

atttggggca 

cgtcagctgc 

agcctatttt 

gttggctatt 

22260 

taaagccaac 

tacatttatg 

ctcaagtatg 

atgaaaatgg 

tacaatcaca 

gatgctgttg 

22320 

attgttctca 

aaatccactt 

gctgaactca 

aatgctctgt 

taagagcttt 

gagattgaca 

22380 

aaggaattta 

ccagacctct 

aatttcaggg 

ttgttccctc 

aggagatgtt 

gtgagattcc 

22440 

ctaatattac 

aaacttgtgt 

ccttttggag 

aggtttttaa 

tgctactaaa 

ttcccttctg 

22500 
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tctatgcatg 

ggagagaaaa 

aaaatttcta 

attgtgttgc 

tgattactct 

gtgctctaca 

22560 

actcaacatt 

tttttcaacc 

tttaagtgct 

atggcgtttc 

tgccactaag 

ttgaatgatc 

22620 

tttgcttctc 

caatgtctat 

gcagattctt 

ttgtagtcaa 

gggagatgat 

gtaagacaaa 

22680 

tagcgccagg 

acaaactggt 

gttattgctg 

attataatta 

taaattgcca 

gatgatttca 

22740 

tgggttgtgt 

ccttgcttgg 

aatactagga 

acattgatgc 

tacttcaact 

ggtaattata 

22800 

attataaata 

taggtatctt 

agacatggca 

agcttaggcc 

ctttgagaga 

gacatatcta 

22860 

atgtgccttt 

ctcccctgat 

ggcaaacctt 

gcaccccacc 

tgctcttaat 

tgttattggc 

22920 

cattaaatga 

ttatggtttt 

tacaccacta 

ctggcattgg 

ctaccaacct 

tacagagttg 

22980 

tagtactttc 

ttttgaactt 

ttaaatgcac 

cggccacggt 

ttgtggacca 

aaattatcca 

23040 

ctgaccttat 

taagaaccag 

tgtgtcaatt 

ttaattttaa 

tggactcact 

ggtactggtg 

23100 

tgttaactcc 

ttcttcaaag 

agatttcaac 

catttcaaca 

atttggccgt 

gatgtttctg 

23160 

atttcactga 

ttccgttcga 

gatcctaaaa 

catctgaaat 

attagacatt 

tcaccttgct 

23220 

cttttggggg 

tgtaagtgta 

attacacctg 

gaacaaatgc 

ttcatctgaa 

gttgctgttc 

23280 

tatatcaaga 

tgttaactgc 

actgatgttt 

ctacagcaat 

tcatgcagat 

caactcacac 

23340 

cagcttggcg 

catatattct 

actggaaaca 

atgtattcca 

gactcaagca 

ggctgtctta 

23400 

taggagctga 

gcatgtcgac 

acttcttatg 

agtgcgacat 

tcctattgga 

gctggcattt 

23460 

gtgctagtta 

ccatacagtt 

tctttattac 

gtagtactag 

ccaaaaatct 

attgtggctt 

23520 

atactatgtc 

tttaggtgct 

gatagttcaa 

ttgcttactc 

taataacacc 

attgctatac 

23580 

ctactaactt 

ttcaattagc 

attactacag 

aagtaatgcc 

tgtttctatg 

gctaaaacct 

23640 

ccgtagattg 

taatatgtac 

atctgcggag 

attctactga 

atgtgctaat 

ttgcttctcc 

23700 

aatatggtag 

cttttgcaca 

caactaaatc 

gtgcactctc 

aggtattgct 

gctgaacagg 

23760 

atcgcaacac 

acgtgaagtg 

ttcgctcaag 

tcaaacaaat 

gtacaaaacc 

ccaactttga 

23820 

aatattttgg 

tggttttaat 

ttttcacaaa 

tattacctga 

ccctctaaag 

ccaactaaga 

23880 

ggtcttttat 

tgaggacttg 

ctctttaata 

aggtgacact 

cgctgatgct 

ggcttcatga 

23940 

agcaatatgg 

cgaatgccta 

ggtgatatta 

atgctagaga 

tctcatttgt 

gcgcagaagt 

24000 

tcaatggact 

tacagtgttg 

ccacctctgc 

tcactgatga 

tatgattgct 

gcctacactg 

24060 

ctgctctagt 

tagtggtact 

gccactgctg 

gatggacatt 

tggtgctggc 

gctgctcttc 

24120 

aaataccttt 

tgctatgcaa 

atggcatata 

ggttcaatgg 

cattggagtt 

acccaaaatg 

24180 

ttctctatga 

gaaccaaaaa 

caaatcgcca 

accaatttaa 

caaggcgatt 

agtcaaattc 

24240 

aagaatcact 

tacaacaaca 

tcaactgcat 

tgggcaagct 

gcaagacgtt 

gttaaccaga 

24300 

atgctcaagc 

attaaacaca 

cttgttaaac 

aacttagctc 

taattttggt 

gcaatttcaa 

24360 

gtgtgctaaa 

tgatatcctt 

tcgcgacttg 

ataaagtcga 

ggcggaggta 

caaattgaca 

24420 

ggttaattac 

aggcagactt 

caaagccttc 

aaacctatgt 

aacacaacaa 

ctaatcaggg 

24480 

ctgctgaaat 

cagggcttct 

gctaatcttg 

ctgctactaa 

aatgtctgag 

tgtgtt.ct.tg 

24540 

gacaatcaaa 

aagagttgac 

ttttgtggaa 

agggctacca 

ccttatgtcc 

ttcccacaag 

24600 

cagccccgca 

tggtgttgtc 

ttcctacatg 

tcacgtatgt 

gccatcccag 

gagaggaact 

24660 

tcaccacagc 

gccagcaatt 

tgtcatgaag 

gcaaagcata 

cttccctcgt 

gaaggtgttt 

24720 

ttgtgtttaa 

tggcacttct 

tggtttatta 

cacagaggaa 

cttcttttct 

ccacaaataa 

24780 

ttactacaga 

caatacattt 

gtctcaggaa 

attgtgatgt 

cgttattggc 

atcattaaca 

24840 

acacagttta 

tgatcctctg 

caacctgagc 

tcgactcatt 

caaagaagag 

ctggacaagt 

24900 
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acttcaaaaa 

tcatacatca 

ccagatgttg 

atcttggcga 

catttcaggc 

attaacgctt 

24960 

ctgtcgtcaa 

cattcaaaaa 

gaaattgacc 

gcctcaatga 

ggtcgctaaa 

aatttaaatg 

25020 

aatcactcat 

tgaccttcaa 

gaattgggaa 

aatatgagca 

atatattaaa 

tggccttggt 

25080 

atgtttggct 

cggcttcatt 

gctggactaa 

ttgccatcgt 

catggttaca 

atcttgcttt 

25140 

gttgcatgac 

tagttgttgc 

agttgcctca 

agggtgcatg 

ctcttgtggt 

tcttgctgca 

25200 

agtttgatga 

ggatgactct 

gagccagttc 

tcaagggtgt 

caaattacat 

tacacataaa 

25260 

cgaacttatg 

gatttgttta 

tgagattttt 

tactcttgga 

tcaattactg 

cacagccagt 

25320 

aaaaattgac 

aatgcttctc 

ctgcaagtac 

tgttcatgct 

acagcaacga 

taccgctaca 

25380 

agcctcactc 

cctttcggat 

ggcttgttat 

tggcgttgca 

tttcttgctg 

tttttcagag 

25440 

cgctaccaaa 

ataattgcgc 

tcaataaaag 

atggcagcta 

gccctttata 

agggcttcca 

25500 

gttcatttgc 

aatttactgc 

tgctatttgt 

taccatctat 

tcacatcttt 

tgcttgtcgc 

25560 

tgcaggtatg 

gaggcgcaat 

ttttgtacct 

ctatgccttg 

atatattttc 

tacaatgcat 

25620 

caacgcatgt 

agaattatta 

tgagatgttg 

gctttgttgg 

aagtgcaaat 

ccaagaaccc 

25680 

attactttat 

gatgccaact 

actttgtttg 

ctggcacaca 

cataactatg 

actactgtat 

25740 

accatataac 

agtgtcacag 

atacaattgt 

cgttactgaa 

ggtgacggca 

tttcaacacc 

25800 

aaaactcaaa 

gaagactacc 

aaattggtgg 

ttattctgag 

gataggcact 

caggtgttaa 

25860 

agactatgtc 

gttgtacatg 

gctatttcac 

cgaagtttac 

taccagcttg 

agtctacaca 

25920 

aattactaca 

gacactggta 

ttgaaaatgc 

tacattcttc 

atctttaaca 

agcttgttaa 

25980 

agacccaccg 

aatgtgcaaa 

tacacacaat 

cgacggctct 

tcaggagttg 

ctaatccagc 

26040 

aatggatcca 

atttatgatg 

agccgacgac 

gactactagc 

gtgcctttgt 

aagcacaaga 

26100 

aagtgagtac 

gaacttatgt 

actcattcgt 

ttcggaagaa 

acaggtacgt 

taatagttaa 

26160 

tagcgtactt 

ctttttcttg 

ctttcgtggt 

attcttgcta 

gtcacactag 

ccatccttac 

26220 

tgcgcttcga 

ttgtgtgcgt 

actgctgcaa 

tattgttaac 

gtgagtttag 

taaaaccaac 

26280 

ggtttacgtc 

tactcgcgtg 

ttaaaaatct 

gaactcttct 

gaaggagttc 

ctgatcttct 

26340 

ggtctaaacg 

aactaactat 

tattattatt 

ctgtttggaa 

ctttaacatt 

gcttatcatg 

26400 

gcagacaacg 

gtactattac 

cgttgaggag 

cttaaacaac 

tcctggaaca 

atggaaccta 

26460 

gtaataggtt 

tcctattcct 

agcctggatt 

atgttactac 

aatttgccta 

ttctaatcgg 

26520 

aacaggtttt 

tgtacataat 

aaagcttgtt 

ttcctctggc 

tcttgtggcc 

agtaacactt 

26580 

gcttgttttg 

tgcttgctgc 

tgtctacaga 

attaattggg 

tgactggcgg 

gattgcgatt 

26640 

gcaatggctt 

gtattgtagg 

cttgatgtgg 

cttagctact 

tcgttgcttc 

cttcaggctg 

26700 

tttgctcgta 

cccgctcaat 

gtggtcattc 

aacccagaaa 

caaacattct 

tctcaatgtg 

26760 

cctctccggg 

ggacaattgt 

gaccagaccg 

ctcatggaaa 

gtgaacttgt 

cattggtgct 

26820 

gtgatcattc 

gtggtcactt 

gcgaatggcc 

ggacaccccc 

tagggcgctg 

tgacattaag 

26880 

gacctgccaa 

aagagatcac 

tgtggctaca 

tcacgaacgc 

tttcttatta 

caaattagga 

26940 

gcgtcgcagc 

gtgtaggcac 

tgattcaggt 

tttgctgcat 

acaaccgcta 

ccgtattgga 

27000 

aactataaat 

taaatacaga 

ccacgccggt 

agcaacgaca 

atattgcttt 

gctagtacag 

27060 

taagtgacaa 

cagatgtttc 

atcttgttga 

cttccaggtt 

acaatagcag 

agatattgat 

27120 

tatcattatg 

aggactttca 

ggattgctat 

ttggaatctt 

gacgttataa 

taagttcaat 

27180 

agtgagacaa 

ttatttaagc 

ctctaactaa 

gaagaattat 

tcggagttag 

atgatgaaga 

27240 
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acctatggag 

ttagattatc 

cataaaacga 

acatgaaaat 

tattetette 

ctgacattga 

27300 

ttgtatttac 

atcttgcgag 

ctatatcact 

atcaggagtg 

tgttagaggt 

acgactgtac 

27360 

tactaaaaga 

accttgccca 

tcaggaacat 

acgagggcaa 

ttcaccattt 

caccctcttg 

27420 

ctgacaataa 

atttgcacta 

acttgcacta 

gcacacactt 

tgettttget 

tgtgctgacg 

27480 

gtactcgaca 

tacctatcag 

ctgcgtgcaa 

gatcagtttc 

accaaaactt 

ttcatcagac 

27540 

aagaggaggt 

tcaacaagag 

ctctactcgc 

cactttttct 

cattgttgct 

getetagtat 

27600 

ttttaatact 

ttgcttcacc 

attaagagaa 

agacagaatg 

aatgagetea 

etttaattga 

27660 

cttctatttg 

tgctttttag 

cctttctgct 

attccttgtt 

ttaataatge 

ttattatatt 

27720 

ttggttttca 

ctcgaaatcc 

aggatctaga 

agaaccttgt 

accaaagtct 

aaacgaacat 

27780 

gaaacttctc 

attgttttga 

cttgtatttc 

tctatgcagt 

tgcatatgca 

ctgtagtaca 

27840 

gcgctgtgca 

tctaataaac 

ctcatgtgct 

tgaagatcct 

tgtaaggtac 

aacactaggg 

27900 

gtaatactta 

tagcactgct 

tggctttgtg 

ctctaggaaa 

ggttttacct 

tttcatagat 

27960 

ggcacactat 

ggttcaaaca 

tgcacaccta 

atgttactat 

caactgtcaa 

gatccagctg 

28020 

gtggtgcgct 

tatagctagg 

tgttggtacc 

ttcatgaagg 

tcaccaaact 

getgeattta 

28080 

gagacgtact 

tgttgtttta 

aataaacgaa 

caaattaaaa 

tgtctgataa 

tggaccccaa 

28140 

tcaaaccaac 

gtagtgcccc 

ccgcattaca 

tttggtggac 

ccacagattc 

aactgacaat 

28200 

aaccagaatg 

gaggacgcaa 

tggggeaagg 

ccaaaacagc 

gccgacccca 

aggtttaccc 

28260 

aataatactg 

cgtcttggtt 

cacagctctc 

actcagcatg 

gcaaggagga 

aettagatte 

28320 

cctcgaggcc 

agggcgttcc 

aatcaacacc 

aatagtggtc 

cagatgacca 

aattggctac 

28380 

taccgaagag 

ctacccgacg 

agttcgtggt 

ggtgacggca 

aaatgaaaga 

gctcagcccc 

28440 

agatggtact 

tctattacct 

aggaactggc 

ccagaagctt 

cacttcccta 

cggcgctaac 

28500 

aaagaaggca 

tcgtatgggt 

tgcaactgag 

ggagccttga 

atacacccaa 

agaccacatt 

28560 

ggcacccgca 

atcctaataa 

caatgctgcc 

accgtgctac 

aacttcctca 

aggaacaaca 

28620 

ttgccaaaag 

gcttctacgc 

agagggaagc 

agaggcggca 

gtcaagcctc 

ttctcgctcc 

28680 

tcatcacgta 

gtcgcggtaa 

ttcaagaaat 

tcaactcctg 

gcagcagtag 

gggaaattct 

28740 

cctgctcgaa 

tggctagcgg 

aggtggtgaa 

actgccctcg 

egetattget 

gctagacaga 

28800 

ttgaaccagc 

ttgagagcaa 

agtttctggt 

aaaggccaac 

aacaacaagg 

ccaaactgtc 

28860 

actaagaaat 

ctgctgctga 

ggcatctaaa 

aagcctcgcc 

aaaaacgtac 

tgccacaaaa 

28920 

cagtacaacg 

tcactcaagc 

atttgggaga 

cgtggtccag 

aacaaaccca 

aggaaatttc 

28980 

ggggaccaag 

acctaatcag 

acaaggaact 

gattacaaac 

attggccgca 

aattgcacaa 

29040 

tttgctccaa 

gtgcctctgc 

attctttgga 

atgtcacgca 

ttggcatgga 

agtcacacct 

29100 

tcgggaacat 

ggctgactta 

tcatggagcc 

attaaattgg 

atgacaaaga 

tccacaattc 

29160 

aaagacaacg 

tcatactgct 

gaacaagcac 

attgacgcat 

acaaaacatt 

cccaccaaca 

29220 

gagcctaaaa 

aggacaaaaa 

gaaaaagact 

gatgaagctc 

agcctttgcc 

gcagagacaa 

29280 

aagaagcagc 

ccactgtgac 

tcttcttcct 

gcggctgaca 

tggatgattt 

ctccagacaa 

29340 

cttcaaaatt 

ccatgagtgg 

agcttctgct 

gatteaactc 

aggeataaae 

actcatgatg 

29400 

accacacaag 

gcagatgggc 

tatgtaaacg 

ttttegeaat 

teegtttaeg 

atacatagtc 

29460 

tactcttgtg 

cagaatgaat 

tctcgtaact 

aaacagcaca 

agtaggttta 

gttaaettta 

29520 

atctcacata 

gcaatcttta 

atcaatgtgt 

aacattaggg 

aggaettgaa 

agagccacca 

29580 

cattttcatc 

gaggccacgc 

ggagtacgat 

egagggtaea 

gtgaataatg 

ctagggagag 

29640 
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ctgcctatat ggaagagccc taatgtgtaa aattaatttt agtagtgcta tccccatgtg 
attttaatag cttcttagga gaatgac 


<210> SEQ ID NO 2 

<211> LENGTH: 4382 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 2 


Met 

Glu 

Ser 

Leu 

Val 

Leu 

Gly 

Val 

Asn 

Glu 

Lys 

Thr 

His 

Val 

Gin 

Leu 

1 




5 





10 





15 


Ser 

Leu 

Pro 

Val 

Leu 

Gin 

Val 

Arg 

Asp 

Val 

Leu 

Val 

Arg 

Gly 

Phe 

Gly 




20 





25 





30 



Asp 

Ser 

Val 

Glu 

Glu 

Ala 

Leu 

Ser 

Glu 

Ala 

Arg 

Glu 

His 

Leu 

Lys 

Asn 



35 





40 





45 




Gly 

Thr 

Cys 

Gly 

Leu 

Val 

Glu 

Leu 

Glu 

Lys 

Gly 

Val 

Leu 

Pro 

Gin 

Leu 


50 





55 





60 





Glu 

Gin 

Pro 

Tyr 

Val 

Phe 

Ile 

Lys 

Arg 

Ser 

Asp 

Ala 

Leu 

Ser 

Thr 

Asn 

65 





70 





75 





80 

His 

Gly 

His 

Lys 

Val 

Val 

Glu 

Leu 

Val 

Ala 

Glu 

Met 

Asp 

Gly 

Ile 

Gin 





85 





90 





95 


Tyr 

Gly 

Arg 

Ser 

Gly 

Ile 

Thr 

Leu 

Gly 

Val 

Leu 

Val 

Pro 

His 

Val 

Gly 




100 





105 





110 



Glu 

Thr 

Pro 

Ile 

Ala 

Tyr 

Arg 

Asn 

Val 

Leu 

Leu 

Arg 

Lys 

Asn 

Gly 

Asn 



115 





120 





125 




Lys 

Gly 

Ala 

Gly 

Gly 

His 

Ser 

Tyr 

Gly 

Ile 

Asp 

Leu 

Lys 

Ser 

Tyr 

Asp 


130 





135 





140 





Leu 

Gly 

Asp 

Glu 

Leu 

Gly 

Thr 

Asp 

Pro 

Ile 

Glu 

Asp 

Tyr 

Glu 

Gin 

Asn 

145 





150 





155 





160 

Trp 

Asn 

Thr 

Lys 

His 

Gly 

Ser 

Gly 

Ala 

Leu 

Arg 

Glu 

Leu 

Thr 

Arg 

Glu 





165 





170 





175 


Leu 

Asn 

Gly 

Gly 

Ala 

Val 

Thr 

Arg 

Tyr 

Val 

Asp 

Asn 

Asn 

Phe 

Cys 

Gly 




180 





185 





190 



Pro 

Asp 

Gly 

Tyr 

Pro 

Leu 

Asp 

Cys 

Ile 

Lys 

Asp 

Phe 

Leu 

Ala 

Arg 

Ala 



195 





200 





205 




Gly 

Lys 

Ser 

Met 

Cys 

Thr 

Leu 

Ser 

Glu 

Gin 

Leu 

Asp 

Tyr 

Ile 

Glu 

Ser 


210 





215 





220 





Lys 

Arg 

Gly 

Val 

Tyr 

Cys 

Cys 

Arg 

Asp 

His 

Glu 

His 

Glu 

Ile 

Ala 

Trp 

225 





230 





235 





240 

Phe 

Thr 

Glu 

Arg 

Ser 

Asp 

Lys 

Ser 

Tyr 

Glu 

His 

Gin 

Thr 

Pro 

Phe 

Glu 





245 





250 





255 


Ile 

Lys 

Ser 

Ala 

Lys 

Lys 

Phe 

Asp 

Thr 

Phe 

Lys 

Gly 

Glu 

Cys 

Pro 

Lys 




260 





265 





270 



Phe 

Val 

Phe 

Pro 

Leu 

Asn 

Ser 

Lys 

Val 

Lys 

Val 

Ile 

Gin 

Pro 

Arg 

Val 



275 





280 





285 




Glu 

Lys 

Lys 

Lys 

Thr 

Glu 

Gly 

Phe 

Met 

Gly 

Arg 

Ile 

Arg 

Ser 

Val 

Tyr 


290 





295 





300 





Pro 

Val 

Ala 

Ser 

Pro 

Gin 

Glu 

Cys 

Asn 

Asn 

Met 

His 

Leu 

Ser 

Thr 

Leu 

305 





310 





315 





320 

Met 

Lys 

Cys 

Asn 

His 

Cys 

Asp 

Glu 

Val 

Ser 

Trp 

Gin 

Thr 

Cys 

Asp 

Phe 





325 





330 





335 


Leu 

Lys 

Ala 

Thr 

Cys 

Glu 

His 

Cys 

Gly 

Thr 

Glu 

Asn 

Leu 

Val 

Ile 

Glu 




340 





345 





350 




29700 

29727 
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Gly 

Pro 

Thr 

Thr 

Cys 

Gly 

Tyr 

Leu 

Pro 

Thr 

Asn 

Ala 

Val 

Val 

Lys 

Met 



355 





360 





365 




Pro 

Cys 

Pro 

Ala 

Cys 

Gin 

Asp 

Pro 

Glu 

Ile 

Gly 

Pro 

Glu 

His 

Ser 

Val 


370 





375 





380 





Ala 

Asp 

Tyr 

His 

Asn 

His 

Ser 

Asn 

Ile 

Glu 

Thr 

Arg 

Leu 

Arg 

Lys 

Gly 

385 





390 





395 





400 

Gly Arg 

Thr 

Arg 

Cys 

Phe 

Gly 

Gly 

Cys 

Val 

Phe 

Ala 

Tyr 

Val 

Gly 

Cys 





405 





410 





415 


Tyr 

Asn 

Lys 

Arg 

Ala 

Tyr 

Trp 

Val 

Pro 

Arg 

Ala 

Ser 

Ala 

Asp 

Ile 

Gly 




420 





425 





430 



Ser 

Gly 

His 

Thr 

Gly 

Ile 

Thr 

Gly 

Asp 

Asn 

Val 

Glu 

Thr 

Leu 

Asn 

Glu 



435 





440 





445 




Asp 

Leu 

Leu 

Glu 

Ile 

Leu 

Ser 

Arg 

Glu 

Arg 

Val 

Asn 

Ile 

Asn 

Ile 

Val 


450 





455 





460 





Gly Asp 

Phe 

His 

Leu 

Asn 

Glu 

Glu 

Val 

Ala 

Ile 

Ile 

Leu 

Ala 

Ser 

Phe 

465 





470 





475 





480 

Ser 

Ala 

Ser 

Thr 

Ser 

Ala 

Phe 

Ile 

Asp 

Thr 

Ile 

Lys 

Ser 

Leu 

Asp 

Tyr 





485 





490 





495 


Lys 

Ser 

Phe 

Lys 

Thr 

Ile 

Val 

Glu 

Ser 

Cys 

Gly 

Asn 

Tyr 

Lys 

Val 

Thr 




500 





505 





510 



Lys 

Gly 

Lys 

Pro 

Val 

Lys 

Gly 

Ala 

Trp 

Asn 

Ile 

Gly 

Gin 

Gin 

Arg 

Ser 



515 





520 





525 




Val 

Leu 

Thr 

Pro 

Leu 

Cys 

Gly 

Phe 

Pro 

Ser 

Gin 

Ala 

Ala 

Gly 

Val 

Ile 


530 





535 





540 





Arg 

Ser 

Ile 

Phe 

Ala 

Arg 

Thr 

Leu 

Asp 

Ala 

Ala 

Asn 

His 

Ser 

Ile 

Pro 

545 





550 





555 





560 

Asp 

Leu 

Gin 

Arg 

Ala 

Ala 

Val 

Thr 

Ile 

Leu 

Asp 

Gly 

Ile 

Ser 

Glu 

Gin 





565 





570 





575 


Ser 

Leu 

Arg 

Leu 

Val 

Asp 

Ala 

Met 

Val 

Tyr 

Thr 

Ser 

Asp 

Leu 

Leu 

Thr 




580 





585 





590 



Asn 

Ser 

Val 

Ile 

Ile 

Met 

Ala 

Tyr 

Val 

Thr 

Gly 

Gly 

Leu 

Val 

Gin 

Gin 



595 





600 





605 




Thr 

Ser 

Gin 

Trp 

Leu 

Ser 

Asn 

Leu 

Leu 

Gly 

Thr 

Thr 

Val 

Glu 

Lys 

Leu 


610 





615 





620 





Arg 

Pro 

Ile 

Phe 

Glu 

Trp 

Ile 

Glu 

Ala 

Lys 

Leu 

Ser 

Ala 

Gly 

Val 

Glu 

625 





630 





635 





640 

Phe 

Leu 

Lys 

Asp 

Ala 

Trp 

Glu 

Ile 

Leu 

Lys 

Phe 

Leu 

Ile 

Thr 

Gly 

Val 





645 





650 





655 


Phe 

Asp 

Ile 

Val 

Lys 

Gly 

Gin 

Ile 

Gin 

Val 

Ala 

Ser 

Asp 

Asn 

Ile 

Lys 




660 





665 





670 



Asp 

Cys 

Val 

Lys 

Cys 

Phe 

Ile 

Asp 

Val 

Val 

Asn 

Lys 

Ala 

Leu 

Glu 

Met 



675 





680 





685 




Cys 

Ile 

Asp 

Gin 

Val 

Thr 

Ile 

Ala 

Gly 

Ala 

Lys 

Leu 

Arg 

Ser 

Leu 

Asn 


690 





695 





700 





Leu 

Gly 

Glu 

Val 

Phe 

Ile 

Ala 

Gin 

Ser 

Lys 

Gly 

Leu 

Tyr 

Arg 

Gin 

Cys 

705 





710 





715 





720 

Ile 

Arg 

Gly 

Lys 

Glu 

Gin 

Leu 

Gin 

Leu 

Leu 

Met 

Pro 

Leu 

Lys 

Ala 

Pro 





725 





730 





735 


Lys 

Glu 

Val 

Thr 

Phe 

Leu 

Glu 

Gly 

Asp 

Ser 

His 

Asp 

Thr 

Val 

Leu 

Thr 




740 





745 





750 



Ser 

Glu 

Glu 

Val 

Val 

Leu 

Lys 

Asn 

Gly 

Glu 

Leu 

Glu 

Ala 

Leu 

Glu 

Thr 



755 





760 





765 




Pro 

Val 

Asp 

Ser 

Phe 

Thr 

Asn 

Gly 

Ala 

Ile 

Val 

Gly 

Thr 

Pro 

Val 

Cys 
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770 775 780 


Val 

Asn 

Gly 

Leu 

Met 

Leu 

Leu 

Glu 

Ile 

Lys 

Asp 

Lys 

Glu 

Gin 

Tyr 

Cys 

785 





790 





795 





800 

Ala 

Leu 

Ser 

Pro 

Gly 

Leu 

Leu 

Ala 

Thr 

Asn 

Asn 

Val 

Phe 

Arg 

Leu 

Lys 





805 





810 





815 


Gly 

Gly Ala 

Pro 

Ile 

Lys 

Gly 

Val 

Thr 

Phe 

Gly 

Glu 

Asp 

Thr 

Val 

Trp 




820 





825 





830 



Glu 

Val 

Gin 

Gly 

Tyr 

Lys 

Asn 

Val 

Arg 

Ile 

Thr 

Phe 

Glu 

Leu 

Asp 

Glu 



835 





840 





845 




Arg 

Val 

Asp 

Lys 

Val 

Leu 

Asn 

Glu 

Lys 

Cys 

Ser 

Val 

Tyr 

Thr 

Val 

Glu 


850 





855 





860 





Ser 

Gly 

Thr 

Glu 

Val 

Thr 

Glu 

Phe 

Ala 

Cys 

Val 

Val 

Ala 

Glu 

Ala 

Val 

865 





870 





875 





880 

Val 

Lys 

Thr 

Leu 

Gin 

Pro 

Val 

Ser 

Asp 

Leu 

Leu 

Thr 

Asn 

Met 

Gly 

Ile 





885 





890 





895 


Asp 

Leu 

Asp 

Glu 

Trp 

Ser 

Val 

Ala 

Thr 

Phe 

Tyr 

Leu 

Phe 

Asp 

Asp 

Ala 




900 





905 





910 



Gly Glu 

Glu 

Asn 

Phe 

Ser 

Ser 

Arg 

Met 

Tyr 

Cys 

Ser 

Phe 

Tyr 

Pro 

Pro 



915 





920 





925 




Asp 

Glu 

Glu 

Glu 

Glu 

Asp 

Asp 

Ala 

Glu 

Cys 

Glu 

Glu 

Glu 

Glu 

Ile 

Asp 


930 





935 





940 





Glu 

Thr 

Cys 

Glu 

His 

Glu 

Tyr 

Gly Thr 

Glu 

Asp 

Asp 

Tyr 

Gin 

Gly 

Leu 

945 





950 





955 





960 

Pro 

Leu 

Glu 

Phe 

Gly 

Ala 

Ser 

Ala 

Glu 

Thr 

Val 

Arg 

Val 

Glu 

Glu 

Glu 





965 





970 





975 


Glu 

Glu 

Glu 

Asp 

Trp 

Leu 

Asp 

Asp 

Thr 

Thr 

Glu 

Gin 

Ser 

Glu 

Ile 

Glu 




980 





985 





990 




Pro Glu Pro Glu Pro Thr Pro Glu Glu Pro Val Asn Gin Phe Thr Gly 
995 1000 1005 


Tyr 

Leu 

1010 

Lys 

Leu 

Thr 

Asp 

Asn 

1015 

Val 

Ala 

Ile 

Lys 

Cys 

1020 

Val 

Asp 

Ile 

Val 

Lys 

1025 

Glu 

Ala 

Gin 

Ser 

Ala 

1030 

Asn 

Pro 

Met 

Val 

Ile 

1035 

Val 

Asn 

Ala 

Ala 

Asn 

1040 

Ile 

His 

Leu 

Lys 

His 

1045 

Gly 

Gly 

Gly 

Val 

Ala 

1050 

Gly 

Ala 

Leu 

Asn 

Lys 

1055 

Ala 

Thr 

Asn 

Gly 

Ala 

1060 

Met 

Gin 

Lys 

Glu 

Ser 

1065 

Asp 

Asp 

Tyr 

Ile 

Lys 

1070 

Leu 

Asn 

Gly 

Pro 

Leu 

1075 

Thr 

Val 

Gly 

Gly 

Ser 

1080 

Cys 

Leu 

Leu 

Ser 

Gly 

1085 

His 

Asn 

Leu 

Ala 

Lys 

1090 

Lys 

Cys 

Leu 

His 

Val 

1095 

Val 

Gly 

Pro 

Asn 

Leu 

1100 

Asn 

Ala 

Gly 

Glu 

Asp 

1105 

Ile 

Gin 

Leu 

Leu 

Lys 

1110 

Ala 

Ala 

Tyr 

Glu 

Asn 

1115 

Phe 

Asn 

Ser 

Gin 

Asp 

1120 

Ile 

Leu 

Leu 

Ala 

Pro 

1125 

Leu 

Leu 

Ser 

Ala 

Gly 

1130 

Ile 

Phe 

Gly 

Ala 

Lys 

1135 

Pro 

Leu 

Gin 

Ser 

Leu 

1140 

Gin 

Val 

Cys 

Val 

Gin 

1145 

Thr 

Val 

Arg 

Thr 

Gin 

1150 

Val 

Tyr 

Ile 

Ala 

Val 

1155 

Asn 

Asp 

Lys 

Ala 

Leu 

1160 

Tyr 

Glu 

Gin 

Val 

Val 

1165 

Met 

Asp 

Tyr 

Leu 

Asp 

1170 

Asn 

Leu 

Lys 

Pro 

Arg 

1175 

Val 

Glu 

Ala 

Pro 

Lys 

1180 

Gin 

Glu 

Glu 

Pro 

Pro 

1185 

Asn 

Thr 

Glu 
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Asp 

Ser 

1190 

Lys 

Thr 

Glu 

Glu 

Lys 

1195 

Ser 

Val 

Val 

Gin 

Lys 

1200 

Pro 

Val 

Asp 

Val 

Lys 

1205 

Pro 

Lys 

Ile 

Lys 

Ala 

1210 

Cys 

Ile 

Asp 

Glu 

Val 

1215 

Thr 

Thr 

Thr 

Leu 

Glu 

1220 

Glu 

Thr 

Lys 

Phe 

Leu 

1225 

Thr 

Asn 

Lys 

Leu 

Leu 

1230 

Leu 

Phe 

Ala 

Asp 

Ile 

1235 

Asn 

Gly 

Lys 

Leu 

Tyr 

1240 

His 

Asp 

Ser 

Gin 

Asn 

1245 

Met 

Leu 

Arg 

Gly 

Glu 

1250 

Asp 

Met 

Ser 

Phe 

Leu 

1255 

Glu 

Lys 

Asp 

Ala 

Pro 

1260 

Tyr 

Met 

Val 

Gly Asp 
1265 

Val 

Ile 

Thr 

Ser 

Gly 

1270 

Asp 

Ile 

Thr 

Cys 

Val 

1275 

Val 

Ile 

Pro 

Ser 

Lys 

1280 

Lys 

Ala 

Gly 

Gly 

Thr 

1285 

Thr 

Glu 

Met 

Leu 

Ser 

1290 

Arg 

Ala 

Leu 

Lys 

Lys 

1295 

Val 

Pro 

Val 

Asp 

Glu 

1300 

Tyr 

Ile 

Thr 

Thr 

Tyr 

1305 

Pro 

Gly Gin 

Gly 

Cys 

1310 

Ala 

Gly Tyr 

Thr 

Leu 

1315 

Glu 

Glu 

Ala 

Lys 

Thr 

1320 

Ala 

Leu 

Lys 

Lys 

Cys 

1325 

Lys 

Ser 

Ala 

Phe 

Tyr 

1330 

Val 

Leu 

Pro 

Ser 

Glu 

1335 

Ala 

Pro 

Asn 

Ala 

Lys 

1340 

Glu 

Glu 

Ile 

Leu 

Gly 

1345 

Thr 

Val 

Ser 

Trp 

Asn 

1350 

Leu 

Arg 

Glu 

Met 

Leu 

1355 

Ala 

His 

Ala 

Glu 

Glu 

1360 

Thr 

Arg 

Lys 

Leu 

Met 

1365 

Pro 

Ile 

Cys 

Met 

Asp 

1370 

Val 

Arg 

Ala 

Ile 

Met 

1375 

Ala 

Thr 

Ile 

Gin 

Arg 

1380 

Lys 

Tyr 

Lys 

Gly 

Ile 

1385 

Lys 

Ile 

Gin 

Glu 

Gly 

1390 

Ile 

Val 

Asp 

Tyr 

Gly 

1395 

Val 

Arg 

Phe 

Phe 

Phe 

1400 

Tyr 

Thr 

Ser 

Lys 

Glu 

1405 

Pro 

Val 

Ala 

Ser 

Ile 

1410 

Ile 

Thr 

Lys 

Leu 

Asn 

1415 

Ser 

Leu 

Asn 

Glu 

Pro 

1420 

Leu 

Val 

Thr 

Met 

Pro 

1425 

Ile 

Gly Tyr 

Val 

Thr 

1430 

His 

Gly Phe 

Asn 

Leu 

1435 

Glu 

Glu 

Ala 

Ala 

Arg 

1440 

Cys 

Met 

Arg 

Ser 

Leu 

1445 

Lys 

Ala 

Pro 

Ala 

Val 

1450 

Val 

Ser 

Val 

Ser 

Ser 

1455 

Pro 

Asp 

Ala 

Val 

Thr 

1460 

Thr 

Tyr 

Asn 

Gly 

Tyr 

1465 

Leu 

Thr 

Ser 

Ser 

Ser 

1470 

Lys 

Thr 

Ser 

Glu 

Glu 

1475 

His 

Phe 

Val 

Glu 

Thr 

1480 

Val 

Ser 

Leu 

Ala 

Gly 

1485 

Ser 

Tyr 

Arg 

Asp 

Trp 

1490 

Ser 

Tyr 

Ser 

Gly 

Gin 

1495 

Arg 

Thr 

Glu 

Leu 

Gly 

1500 

Val 

Glu 

Phe 

Leu 

Lys 

1505 

Arg 

Gly Asp 

Lys 

Ile 

1510 

Val 

Tyr 

His 

Thr 

Leu 

1515 

Glu 

Ser 

Pro 

Val 

Glu 

1520 

Phe 

His 

Leu 

Asp 

Gly 

1525 

Glu 

Val 

Leu 

Ser 

Leu 

1530 

Asp 

Lys 

Leu 

Lys 

Ser 

1535 

Leu 

Leu 

Ser 

Leu 

Arg 

1540 

Glu 

Val 

Lys 

Thr 

Ile 

1545 

Lys 

Val 

Phe 

Thr 

Thr 

1550 

Val 

Asp 

Asn 

Thr 

Asn 

1555 

Leu 

His 

Thr 

Gin 

Leu 

1560 

Val 

Asp 

Met 

Ser 

Met 

1565 

Thr 

Tyr 

Gly 

Gin 

Gin 

1570 

Phe 

Gly 

Pro 

Thr 

Tyr 

1575 

Leu 

Asp 

Gly 
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Ala 

Asp 

1580 

Val 

Thr 

Lys 

Ile 

Lys 

1585 

Pro 

His 

Val 

Asn 

His 

1590 

Glu 

Gly Lys 

Thr 

Phe 

1595 

Phe 

Val 

Leu 

Pro 

Ser 

1600 

Asp 

Asp 

Thr 

Leu 

Arg 

1605 

Ser 

Glu 

Ala 

Phe 

Glu 

1610 

Tyr 

Tyr 

His 

Thr 

Leu 

1615 

Asp 

Glu 

Ser 

Phe 

Leu 

1620 

Gly 

Arg 

Tyr 

Met 

Ser 

1625 

Ala 

Leu 

Asn 

His 

Thr 

1630 

Lys 

Lys 

Trp 

Lys 

Phe 

1635 

Pro 

Gin 

Val 

Gly 

Gly 

1640 

Leu 

Thr 

Ser 

Ile 

Lys 

1645 

Trp 

Ala 

Asp 

Asn 

Asn 

1650 

Cys 

Tyr 

Leu 

Ser 

Ser 

1655 

Val 

Leu 

Leu 

Ala 

Leu 

1660 

Gin 

Gin 

Leu 

Glu 

Val 

1665 

Lys 

Phe 

Asn 

Ala 

Pro 

1670 

Ala 

Leu 

Gin 

Glu 

Ala 

1675 

Tyr 

Tyr 

Arg 

Ala 

Arg 

1680 

Ala 

Gly Asp 

Ala 

Ala 

1685 

Asn 

Phe 

Cys 

Ala 

Leu 

1690 

Ile 

Leu 

Ala 

Tyr 

Ser 

1695 

Asn 

Lys 

Thr 

Val 

Gly 

1700 

Glu 

Leu 

Gly 

Asp 

Val 

1705 

Arg 

Glu 

Thr 

Met 

Thr 

1710 

His 

Leu 

Leu 

Gin 

His 

1715 

Ala 

Asn 

Leu 

Glu 

Ser 

1720 

Ala 

Lys 

Arg 

Val 

Leu 

1725 

Asn 

Val 

Val 

Cys 

Lys 

1730 

His 

Cys 

Gly 

Gin 

Lys 

1735 

Thr 

Thr 

Thr 

Leu 

Thr 

1740 

Gly 

Val 

Glu 

Ala 

Val 

1745 

Met 

Tyr 

Met 

Gly 

Thr 

1750 

Leu 

Ser 

Tyr 

Asp 

Asn 

1755 

Leu 

Lys 

Thr 

Gly 

Val 

1760 

Ser 

Ile 

Pro 

Cys 

Val 

1765 

Cys 

Gly Arg 

Asp 

Ala 

1770 

Thr 

Gin 

Tyr 

Leu 

Val 

1775 

Gin 

Gin 

Glu 

Ser 

Ser 

1780 

Phe 

Val 

Met 

Met 

Ser 

1785 

Ala 

Pro 

Pro 

Ala 

Glu 

1790 

Tyr 

Lys 

Leu 

Gin 

Gin 

1795 

Gly 

Thr 

Phe 

Leu 

Cys 

1800 

Ala 

Asn 

Glu 

Tyr 

Thr 

1805 

Gly 

Asn 

Tyr 

Gin 

Cys 

1810 

Gly 

His 

Tyr 

Thr 

His 

1815 

Ile 

Thr 

Ala 

Lys 

Glu 

1820 

Thr 

Leu 

Tyr 

Arg 

Ile 

1825 

Asp 

Gly Ala 

His 

Leu 

1830 

Thr 

Lys 

Met 

Ser 

Glu 

1835 

Tyr 

Lys 

Gly 

Pro 

Val 

1840 

Thr 

Asp 

Val 

Phe 

Tyr 

1845 

Lys 

Glu 

Thr 

Ser 

Tyr 

1850 

Thr 

Thr 

Thr 

Ile 

Lys 

1855 

Pro 

Val 

Ser 

Tyr 

Lys 

1860 

Leu 

Asp 

Gly 

Val 

Thr 

1865 

Tyr 

Thr 

Glu 

Ile 

Glu 

1870 

Pro 

Lys 

Leu 

Asp 

Gly 

1875 

Tyr 

Tyr 

Lys 

Lys 

Asp 

1880 

Asn 

Ala 

Tyr 

Tyr 

Thr 

1885 

Glu 

Gin 

Pro 

Ile 

Asp 

1890 

Leu 

Val 

Pro 

Thr 

Gin 

1895 

Pro 

Leu 

Pro 

Asn 

Ala 

1900 

Ser 

Phe 

Asp 

Asn 

Phe 

1905 

Lys 

Leu 

Thr 

Cys 

Ser 

1910 

Asn 

Thr 

Lys 

Phe 

Ala 

1915 

Asp 

Asp 

Leu 

Asn 

Gin 

1920 

Met 

Thr 

Gly 

Phe 

Thr 

1925 

Lys 

Pro 

Ala 

Ser 

Arg 

1930 

Glu 

Leu 

Ser 

Val 

Thr 

1935 

Phe 

Phe 

Pro 

Asp 

Leu 

1940 

Asn 

Gly Asp 

Val 

Val 

1945 

Ala 

Ile 

Asp 

Tyr 

Arg 

1950 

His 

Tyr 

Ser 

Ala 

Ser 

1955 

Phe 

Lys 

Lys 

Gly 

Ala 

1960 

Lys 

Leu 

Leu 

His 

Lys 

1965 

Pro 

Ile 

Val 

Trp 

His 

Ile 

Asn 

Gin 

Ala 

Thr 

Thr 

Lys 

Thr 

Thr 

Phe 

Lys 

Pro 

Asn 
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1970 





1975 





1980 




Thr 

Trp 

1985 

Cys 

Leu 

Arg 

Cys 

Leu 

1990 

Trp 

Ser 

Thr 

Lys 

Pro 

1995 

Val 

Asp 

Thr 

Ser 

Asn 

2000 

Ser 

Phe 

Glu 

Val 

Leu 

2005 

Ala 

Val 

Glu 

Asp 

Thr 

2010 

Gin 

Gly Met 

Asp 

Asn 

2015 

Leu 

Ala 

Cys 

Glu 

Ser 

2020 

Gin 

Gin 

Pro 

Thr 

Ser 

2025 

Glu 

Glu 

Val 

Val 

Glu 

2030 

Asn 

Pro 

Thr 

Ile 

Gin 

2035 

Lys 

Glu 

Val 

Ile 

Glu 

2040 

Cys 

Asp 

Val 

Lys 

Thr 

2045 

Thr 

Glu 

Val 

Val 

Gly 

2050 

Asn 

Val 

Ile 

Leu 

Lys 

2055 

Pro 

Ser 

Asp 

Glu 

Gly 

2060 

Val 

Lys 

Val 

Thr 

Gin 

2065 

Glu 

Leu 

Gly 

His 

Glu 

2070 

Asp 

Leu 

Met 

Ala 

Ala 

2075 

Tyr 

Val 

Glu 

Asn 

Thr 

2080 

Ser 

Ile 

Thr 

Ile 

Lys 

2085 

Lys 

Pro 

Asn 

Glu 

Leu 

2090 

Ser 

Leu 

Ala 

Leu 

Gly 

2095 

Leu 

Lys 

Thr 

Ile 

Ala 

2100 

Thr 

His 

Gly 

Ile 

Ala 

2105 

Ala 

Ile 

Asn 

Ser 

Val 

2110 

Pro 

Trp 

Ser 

Lys 

Ile 

2115 

Leu 

Ala 

Tyr 

Val 

Lys 

2120 

Pro 

Phe 

Leu 

Gly 

Gin 

2125 

Ala 

Ala 

Ile 

Thr 

Thr 

2130 

Ser 

Asn 

Cys 

Ala 

Lys 

2135 

Arg 

Leu 

Ala 

Gin 

Arg 

2140 

Val 

Phe 

Asn 

Asn 

Tyr 

2145 

Met 

Pro 

Tyr 

Val 

Phe 

2150 

Thr 

Leu 

Leu 

Phe 

Gin 

2155 

Leu 

Cys 

Thr 

Phe 

Thr 

2160 

Lys 

Ser 

Thr 

Asn 

Ser 

2165 

Arg 

Ile 

Arg 

Ala 

Ser 

2170 

Leu 

Pro 

Thr 

Thr 

Ile 

2175 

Ala 

Lys 

Asn 

Ser 

Val 

2180 

Lys 

Ser 

Val 

Ala 

Lys 

2185 

Leu 

Cys 

Leu 

Asp 

Ala 

2190 

Gly 

Ile 

Asn 

Tyr 

Val 

2195 

Lys 

Ser 

Pro 

Lys 

Phe 

2200 

Ser 

Lys 

Leu 

Phe 

Thr 

2205 

Ile 

Ala 

Met 

Trp 

Leu 

2210 

Leu 

Leu 

Leu 

Ser 

Ile 

2215 

Cys 

Leu 

Gly 

Ser 

Leu 

2220 

Ile 

Cys 

Val 

Thr 

Ala 

2225 

Ala 

Phe 

Gly 

Val 

Leu 

2230 

Leu 

Ser 

Asn 

Phe 

Gly 

2235 

Ala 

Pro 

Ser 

Tyr 

Cys 

2240 

Asn 

Gly Val 

Arg 

Glu 

2245 

Leu 

Tyr 

Leu 

Asn 

Ser 

2250 

Ser 

Asn 

Val 

Thr 

Thr 

2255 

Met 

Asp 

Phe 

Cys 

Glu 

2260 

Gly 

Ser 

Phe 

Pro 

Cys 

2265 

Ser 

Ile 

Cys 

Leu 

Ser 

2270 

Gly 

Leu 

Asp 

Ser 

Leu 

2275 

Asp 

Ser 

Tyr 

Pro 

Ala 

2280 

Leu 

Glu 

Thr 

Ile 

Gin 

2285 

Val 

Thr 

Ile 

Ser 

Ser 

2290 

Tyr 

Lys 

Leu 

Asp 

Leu 

2295 

Thr 

Ile 

Leu 

Gly 

Leu 

2300 

Ala 

Ala 

Glu 

Trp 

Val 

2305 

Leu 

Ala 

Tyr 

Met 

Leu 

2310 

Phe 

Thr 

Lys 

Phe 

Phe 

2315 

Tyr 

Leu 

Leu 

Gly 

Leu 

2320 

Ser 

Ala 

Ile 

Met 

Gin 

2325 

Val 

Phe 

Phe 

Gly 

Tyr 

2330 

Phe 

Ala 

Ser 

His 

Phe 

2335 

Ile 

Ser 

Asn 

Ser 

Trp 

2340 

Leu 

Met 

Trp 

Phe 

Ile 

2345 

Ile 

Ser 

Ile 

Val 

Gin 

2350 

Met 

Ala 

Pro 

Val 

Ser 

2355 

Ala 

Met 

Val 

Arg 

Met 

2360 

Tyr 

Ile 

Phe 

Phe 

Ala 

2365 

Ser 

Phe 

Tyr 

Tyr 

Ile 

2370 

Trp 

Lys 

Ser 
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Tyr 

Val 

2375 

His 

Ile 

Met 

Asp 

Gly 

2380 

Cys 

Thr 

Ser 

Ser 

Thr 

2385 

Cys 

Met 

Met 

Cys 

Tyr 

2390 

Lys 

Arg 

Asn 

Arg 

Ala 

2395 

Thr 

Arg 

Val 

Glu 

Cys 

2400 

Thr 

Thr 

Ile 

Val 

Asn 

2405 

Gly 

Met 

Lys 

Arg 

Ser 

2410 

Phe 

Tyr 

Val 

Tyr 

Ala 

2415 

Asn 

Gly Gly 

Arg 

Gly 

2420 

Phe 

Cys 

Lys 

Thr 

His 

2425 

Asn 

Trp 

Asn 

Cys 

Leu 

2430 

Asn 

Cys 

Asp 

Thr 

Phe 

2435 

Cys 

Thr 

Gly 

Ser 

Thr 

2440 

Phe 

Ile 

Ser 

Asp 

Glu 

2445 

Val 

Ala 

Arg 

Asp 

Leu 

2450 

Ser 

Leu 

Gin 

Phe 

Lys 

2455 

Arg 

Pro 

Ile 

Asn 

Pro 

2460 

Thr 

Asp 

Gin 

Ser 

Ser 

2465 

Tyr 

Ile 

Val 

Asp 

Ser 

2470 

Val 

Ala 

Val 

Lys 

Asn 

2475 

Gly 

Ala 

Leu 

His 

Leu 

2480 

Tyr 

Phe 

Asp 

Lys 

Ala 

2485 

Gly 

Gin 

Lys 

Thr 

Tyr 

2490 

Glu 

Arg 

His 

Pro 

Leu 

2495 

Ser 

His 

Phe 

Val 

Asn 

2500 

Leu 

Asp 

Asn 

Leu 

Arg 

2505 

Ala 

Asn 

Asn 

Thr 

Lys 

2510 

Gly 

Ser 

Leu 

Pro 

Ile 

2515 

Asn 

Val 

Ile 

Val 

Phe 

2520 

Asp 

Gly Lys 

Ser 

Lys 

2525 

Cys 

Asp 

Glu 

Ser 

Ala 

2530 

Ser 

Lys 

Ser 

Ala 

Ser 

2535 

Val 

Tyr 

Tyr 

Ser 

Gin 

2540 

Leu 

Met 

Cys 

Gin 

Pro 

2545 

Ile 

Leu 

Leu 

Leu 

Asp 

2550 

Gin 

Val 

Leu 

Val 

Ser 

2555 

Asp 

Val 

Gly 

Asp 

Ser 

2560 

Thr 

Glu 

Val 

Ser 

Val 

2565 

Lys 

Met 

Phe 

Asp 

Ala 

2570 

Tyr 

Val 

Asp 

Thr 

Phe 

2575 

Ser 

Ala 

Thr 

Phe 

Ser 

2580 

Val 

Pro 

Met 

Glu 

Lys 

2585 

Leu 

Lys 

Ala 

Leu 

Val 

2590 

Ala 

Thr 

Ala 

His 

Ser 

2595 

Glu 

Leu 

Ala 

Lys 

Gly 

2600 

Val 

Ala 

Leu 

Asp 

Gly 

2605 

Val 

Leu 

Ser 

Thr 

Phe 

2610 

Val 

Ser 

Ala 

Ala 

Arg 

2615 

Gin 

Gly Val 

Val 

Asp 

2620 

Thr 

Asp 

Val 

Asp 

Thr 

2625 

Lys 

Asp 

Val 

Ile 

Glu 

2630 

Cys 

Leu 

Lys 

Leu 

Ser 

2635 

His 

His 

Ser 

Asp 

Leu 

2640 

Glu 

Val 

Thr 

Gly 

Asp 

2645 

Ser 

Cys 

Asn 

Asn 

Phe 

2650 

Met 

Leu 

Thr 

Tyr 

Asn 

2655 

Lys 

Val 

Glu 

Asn 

Met 

2660 

Thr 

Pro 

Arg 

Asp 

Leu 

2665 

Gly 

Ala 

Cys 

Ile 

Asp 

2670 

Cys 

Asn 

Ala 

Arg 

His 

2675 

Ile 

Asn 

Ala 

Gin 

Val 

2680 

Ala 

Lys 

Ser 

His 

Asn 

2685 

Val 

Ser 

Leu 

Ile 

Trp 

2690 

Asn 

Val 

Lys 

Asp 

Tyr 

2695 

Met 

Ser 

Leu 

Ser 

Glu 

2700 

Gin 

Leu 

Arg 

Lys 

Gin 

2705 

Ile 

Arg 

Ser 

Ala 

Ala 

2710 

Lys 

Lys 

Asn 

Asn 

Ile 

2715 

Pro 

Phe 

Arg 

Leu 

Thr 

2720 

Cys 

Ala 

Thr 

Thr 

Arg 

2725 

Gin 

Val 

Val 

Asn 

Val 

2730 

Ile 

Thr 

Thr 

Lys 

Ile 

2735 

Ser 

Leu 

Lys 

Gly 

Gly 

2740 

Lys 

Ile 

Val 

Ser 

Thr 

2745 

Cys 

Phe 

Lys 

Leu 

Met 

2750 

Leu 

Lys 

Ala 

Thr 

Leu 

2755 

Leu 

Cys 

Val 

Leu 

Ala 

2760 

Ala 

Leu 

Val 
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Cys 

Tyr 

2765 

Ile 

Val 

Met 

Pro 

Val 

2770 

His 

Thr 

Leu 

Ser 

Ile 

2775 

His 

Asp 

Gly 

Tyr 

Thr 

2780 

Asn 

Glu 

Ile 

Ile 

Gly 

2785 

Tyr 

Lys 

Ala 

Ile 

Gin 

2790 

Asp 

Gly Val 

Thr 

Arg 

2795 

Asp 

Ile 

Ile 

Ser 

Thr 

2800 

Asp 

Asp 

Cys 

Phe 

Ala 

2805 

Asn 

Lys 

His 

Ala 

Gly 

2810 

Phe 

Asp 

Ala 

Trp 

Phe 

2815 

Ser 

Gin 

Arg 

Gly Gly 
2820 

Ser 

Tyr 

Lys 

Asn 

Asp 

2825 

Lys 

Ser 

Cys 

Pro 

Val 

2830 

Val 

Ala 

Ala 

Ile 

Ile 

2835 

Thr 

Arg 

Glu 

Ile 

Gly 

2840 

Phe 

Ile 

Val 

Pro 

Gly 

2845 

Leu 

Pro 

Gly 

Thr 

Val 

2850 

Leu 

Arg 

Ala 

Ile 

Asn 

2855 

Gly Asp 

Phe 

Leu 

His 

2860 

Phe 

Leu 

Pro 

Arg 

Val 

2865 

Phe 

Ser 

Ala 

Val 

Gly 

2870 

Asn 

Ile 

Cys 

Tyr 

Thr 

2875 

Pro 

Ser 

Lys 

Leu 

Ile 

2880 

Glu 

Tyr 

Ser 

Asp 

Phe 

2885 

Ala 

Thr 

Ser 

Ala 

Cys 

2890 

Val 

Leu 

Ala 

Ala 

Glu 

2895 

Cys 

Thr 

Ile 

Phe 

Lys 

2900 

Asp 

Ala 

Met 

Gly Lys 
2905 

Pro 

Val 

Pro 

Tyr 

Cys 

2910 

Tyr Asp 

Thr 

Asn 

Leu 

2915 

Leu 

Glu 

Gly 

Ser 

Ile 

2920 

Ser 

Tyr 

Ser 

Glu 

Leu 

2925 

Arg 

Pro 

Asp 

Thr 

Arg 

2930 

Tyr 

Val 

Leu 

Met 

Asp 

2935 

Gly 

Ser 

Ile 

Ile 

Gin 

2940 

Phe 

Pro 

Asn 

Thr 

Tyr 

2945 

Leu 

Glu 

Gly 

Ser 

Val 

2950 

Arg 

Val 

Val 

Thr 

Thr 

2955 

Phe 

Asp 

Ala 

Glu 

Tyr 

2960 

Cys 

Arg 

His 

Gly Thr 
2965 

Cys 

Glu 

Arg 

Ser 

Glu 

2970 

Val 

Gly 

Ile 

Cys 

Leu 

2975 

Ser 

Thr 

Ser 

Gly Arg 
2980 

Trp 

Val 

Leu 

Asn 

Asn 

2985 

Glu 

His 

Tyr 

Arg 

Ala 

2990 

Leu 

Ser 

Gly Val 

Phe 

2995 

Cys 

Gly Val 

Asp 

Ala 

3000 

Met 

Asn 

Leu 

Ile 

Ala 

3005 

Asn 

Ile 

Phe 

Thr 

Pro 

3010 

Leu 

Val 

Gin 

Pro 

Val 

3015 

Gly Ala 

Leu 

Asp 

Val 

3020 

Ser 

Ala 

Ser 

Val 

Val 

3025 

Ala 

Gly 

Gly 

Ile 

Ile 

3030 

Ala 

Ile 

Leu 

Val 

Thr 

3035 

Cys 

Ala 

Ala 

Tyr 

Tyr 

3040 

Phe 

Met 

Lys 

Phe 

Arg 

3045 

Arg 

Val 

Phe 

Gly 

Glu 

3050 

Tyr 

Asn 

His 

Val 

Val 

3055 

Ala 

Ala 

Asn 

Ala 

Leu 

3060 

Leu 

Phe 

Leu 

Met 

Ser 

3065 

Phe 

Thr 

Ile 

Leu 

Cys 

3070 

Leu 

Val 

Pro 

Ala 

Tyr 

3075 

Ser 

Phe 

Leu 

Pro 

Gly 

3080 

Val 

Tyr 

Ser 

Val 

Phe 

3085 

Tyr 

Leu 

Tyr 

Leu 

Thr 

3090 

Phe 

Tyr 

Phe 

Thr 

Asn 

3095 

Asp 

Val 

Ser 

Phe 

Leu 

3100 

Ala 

His 

Leu 

Gin 

Trp 

3105 

Phe 

Ala 

Met 

Phe 

Ser 

3110 

Pro 

Ile 

Val 

Pro 

Phe 

3115 

Trp 

Ile 

Thr 

Ala 

Ile 

3120 

Tyr 

Val 

Phe 

Cys 

Ile 

3125 

Ser 

Leu 

Lys 

His 

Cys 

3130 

His 

Trp 

Phe 

Phe 

Asn 

3135 

Asn 

Tyr 

Leu 

Arg 

Lys 

3140 

Arg 

Val 

Met 

Phe 

Asn 

3145 

Gly Val 

Thr 

Phe 

Ser 

3150 

Thr 

Phe 

Glu 

Glu 

Ala 

Ala 

Leu 

Cys 

Thr 

Phe 

Leu 

Leu 

Asn 

Lys 

Glu 

Met 

Tyr 

Leu 
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3155 





3160 





3165 




Lys 

Leu 

3170 

Arg 

Ser 

Glu 

Thr 

Leu 

3175 

Leu 

Pro 

Leu 

Thr 

Gin 

3180 

Tyr 

Asn 

Arg 

Tyr 

Leu 

3185 

Ala 

Leu 

Tyr 

Asn 

Lys 

3190 

Tyr 

Lys 

Tyr 

Phe 

Ser 

3195 

Gly 

Ala 

Leu 

Asp 

Thr 

3200 

Thr 

Ser 

Tyr 

Arg 

Glu 

3205 

Ala 

Ala 

Cys 

Cys 

His 

3210 

Leu 

Ala 

Lys 

Ala 

Leu 

3215 

Asn 

Asp 

Phe 

Ser 

Asn 

3220 

Ser 

Gly Ala 

Asp 

Val 

3225 

Leu 

Tyr 

Gin 

Pro 

Pro 

3230 

Gin 

Thr 

Ser 

Ile 

Thr 

3235 

Ser 

Ala 

Val 

Leu 

Gin 

3240 

Ser 

Gly Phe 

Arg 

Lys 

3245 

Met 

Ala 

Phe 

Pro 

Ser 

3250 

Gly 

Lys 

Val 

Glu 

Gly 

3255 

Cys 

Met 

Val 

Gin 

Val 

3260 

Thr 

Cys 

Gly 

Thr 

Thr 

3265 

Thr 

Leu 

Asn 

Gly 

Leu 

3270 

Trp 

Leu 

Asp 

Asp 

Thr 

3275 

Val 

Tyr 

Cys 

Pro 

Arg 

3280 

His 

Val 

Ile 

Cys 

Thr 

3285 

Ala 

Glu 

Asp 

Met 

Leu 

3290 

Asn 

Pro 

Asn 

Tyr 

Glu 

3295 

Asp 

Leu 

Leu 

Ile 

Arg 

3300 

Lys 

Ser 

Asn 

His 

Ser 

3305 

Phe 

Leu 

Val 

Gin 

Ala 

3310 

Gly 

Asn 

Val 

Gin 

Leu 

3315 

Arg 

Val 

Ile 

Gly 

His 

3320 

Ser 

Met 

Gin 

Asn 

Cys 

3325 

Leu 

Leu 

Arg 

Leu 

Lys 

3330 

Val 

Asp 

Thr 

Ser 

Asn 

3335 

Pro 

Lys 

Thr 

Pro 

Lys 

3340 

Tyr 

Lys 

Phe 

Val 

Arg 

3345 

Ile 

Gin 

Pro 

Gly 

Gin 

3350 

Thr 

Phe 

Ser 

Val 

Leu 

3355 

Ala 

Cys 

Tyr 

Asn 

Gly 

3360 

Ser 

Pro 

Ser 

Gly 

Val 

3365 

Tyr 

Gin 

Cys 

Ala 

Met 

3370 

Arg 

Pro 

Asn 

His 

Thr 

3375 

Ile 

Lys 

Gly 

Ser 

Phe 

3380 

Leu 

Asn 

Gly 

Ser 

Cys 

3385 

Gly 

Ser 

Val 

Gly 

Phe 

3390 

Asn 

Ile 

Asp 

Tyr 

Asp 

3395 

Cys 

Val 

Ser 

Phe 

Cys 

3400 

Tyr 

Met 

His 

His 

Met 

3405 

Glu 

Leu 

Pro 

Thr 

Gly 

3410 

Val 

His 

Ala 

Gly 

Thr 

3415 

Asp 

Leu 

Glu 

Gly 

Lys 

3420 

Phe 

Tyr 

Gly 

Pro 

Phe 

3425 

Val 

Asp 

Arg 

Gin 

Thr 

3430 

Ala 

Gin 

Ala 

Ala 

Gly 

3435 

Thr 

Asp 

Thr 

Thr 

Ile 

3440 

Thr 

Leu 

Asn 

Val 

Leu 

3445 

Ala 

Trp 

Leu 

Tyr 

Ala 

3450 

Ala 

Val 

Ile 

Asn 

Gly 

3455 

Asp 

Arg 

Trp 

Phe 

Leu 

3460 

Asn 

Arg 

Phe 

Thr 

Thr 

3465 

Thr 

Leu 

Asn 

Asp 

Phe 

3470 

Asn 

Leu 

Val 

Ala 

Met 

3475 

Lys 

Tyr 

Asn 

Tyr 

Glu 

3480 

Pro 

Leu 

Thr 

Gin 

Asp 

3485 

His 

Val 

Asp 

Ile 

Leu 

3490 

Gly 

Pro 

Leu 

Ser 

Ala 

3495 

Gin 

Thr 

Gly 

Ile 

Ala 

3500 

Val 

Leu 

Asp 

Met 

Cys 

3505 

Ala 

Ala 

Leu 

Lys 

Glu 

3510 

Leu 

Leu 

Gin 

Asn 

Gly 

3515 

Met 

Asn 

Gly 

Arg 

Thr 

3520 

Ile 

Leu 

Gly 

Ser 

Thr 

3525 

Ile 

Leu 

Glu 

Asp 

Glu 

3530 

Phe 

Thr 

Pro 

Phe 

Asp 

3535 

Val 

Val 

Arg 

Gin 

Cys 

3540 

Ser 

Gly Val 

Thr 

Phe 

3545 

Gin 

Gly 

Lys 

Phe 

Lys 

3550 

Lys 

Ile 

Val 

Lys 

Gly 

3555 

Thr 

His 

His 
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Trp 

Met 

3560 

Leu 

Leu 

Thr 

Phe 

Leu 

3565 

Thr 

Ser 

Leu 

Leu 

Ile 

3570 

Leu 

Val 

Gin 

Ser 

Thr 

3575 

Gin 

Trp 

Ser 

Leu 

Phe 

3580 

Phe 

Phe 

Val 

Tyr 

Glu 

3585 

Asn 

Ala 

Phe 

Leu 

Pro 

3590 

Phe 

Thr 

Leu 

Gly 

Ile 

3595 

Met 

Ala 

Ile 

Ala 

Ala 

3600 

Cys 

Ala 

Met 

Leu 

Leu 

3605 

Val 

Lys 

His 

Lys 

His 

3610 

Ala 

Phe 

Leu 

Cys 

Leu 

3615 

Phe 

Leu 

Leu 

Pro 

Ser 

3620 

Leu 

Ala 

Thr 

Val 

Ala 

3625 

Tyr 

Phe 

Asn 

Met 

Val 

3630 

Tyr 

Met 

Pro 

Ala 

Ser 

3635 

Trp 

Val 

Met 

Arg 

Ile 

3640 

Met 

Thr 

Trp 

Leu 

Glu 

3645 

Leu 

Ala 

Asp 

Thr 

Ser 

3650 

Leu 

Ser 

Gly 

Tyr 

Arg 

3655 

Leu 

Lys 

Asp 

Cys 

Val 

3660 

Met 

Tyr 

Ala 

Ser 

Ala 

3665 

Leu 

Val 

Leu 

Leu 

Ile 

3670 

Leu 

Met 

Thr 

Ala 

Arg 

3675 

Thr 

Val 

Tyr 

Asp 

Asp 

3680 

Ala 

Ala 

Arg 

Arg 

Val 

3685 

Trp 

Thr 

Leu 

Met 

Asn 

3690 

Val 

Ile 

Thr 

Leu 

Val 

3695 

Tyr 

Lys 

Val 

Tyr 

Tyr 

3700 

Gly Asn 

Ala 

Leu 

Asp 

3705 

Gin 

Ala 

Ile 

Ser 

Met 

3710 

Trp 

Ala 

Leu 

Val 

Ile 

3715 

Ser 

Val 

Thr 

Ser 

Asn 

3720 

Tyr 

Ser 

Gly 

Val 

Val 

3725 

Thr 

Thr 

Ile 

Met 

Phe 

3730 

Leu 

Ala 

Arg 

Ala 

Ile 

3735 

Val 

Phe 

Val 

Cys 

Val 

3740 

Glu 

Tyr 

Tyr 

Pro 

Leu 

3745 

Leu 

Phe 

Ile 

Thr Gly 
3750 

Asn 

Thr 

Leu 

Gin 

Cys 

3755 

Ile 

Met 

Leu 

Val 

Tyr 

3760 

Cys 

Phe 

Leu 

Gly Tyr 
3765 

Cys 

Cys 

Cys 

Cys 

Tyr 

3770 

Phe 

Gly 

Leu 

Phe 

Cys 

3775 

Leu 

Leu 

Asn 

Arg 

Tyr 

3780 

Phe 

Arg 

Leu 

Thr 

Leu 

3785 

Gly Val 

Tyr 

Asp 

Tyr 

3790 

Leu 

Val 

Ser 

Thr 

Gin 

3795 

Glu 

Phe 

Arg 

Tyr 

Met 

3800 

Asn 

Ser 

Gin 

Gly 

Leu 

3805 

Leu 

Pro 

Pro 

Lys 

Ser 

3810 

Ser 

Ile 

Asp 

Ala 

Phe 

3815 

Lys 

Leu 

Asn 

Ile 

Lys 

3820 

Leu 

Leu 

Gly 

Ile 

Gly 

3825 

Gly Lys 

Pro 

Cys 

Ile 

3830 

Lys 

Val 

Ala 

Thr 

Val 

3835 

Gin 

Ser 

Lys 

Met 

Ser 

3840 

Asp 

Val 

Lys 

Cys 

Thr 

3845 

Ser 

Val 

Val 

Leu 

Leu 

3850 

Ser 

Val 

Leu 

Gin 

Gin 

3855 

Leu 

Arg 

Val 

Glu 

Ser 

3860 

Ser 

Ser 

Lys 

Leu 

Trp 

3865 

Ala 

Gin 

Cys 

Val 

Gin 

3870 

Leu 

His 

Asn 

Asp 

Ile 

3875 

Leu 

Leu 

Ala 

Lys 

Asp 

3880 

Thr 

Thr 

Glu 

Ala 

Phe 

3885 

Glu 

Lys 

Met 

Val 

Ser 

3890 

Leu 

Leu 

Ser 

Val 

Leu 

3895 

Leu 

Ser 

Met 

Gin 

Gly 

3900 

Ala 

Val 

Asp 

Ile 

Asn 

3905 

Arg 

Leu 

Cys 

Glu 

Glu 

3910 

Met 

Leu 

Asp 

Asn 

Arg 

3915 

Ala 

Thr 

Leu 

Gin 

Ala 

3920 

Ile 

Ala 

Ser 

Glu 

Phe 

3925 

Ser 

Ser 

Leu 

Pro 

Ser 

3930 

Tyr 

Ala 

Ala 

Tyr 

Ala 

3935 

Thr 

Ala 

Gin 

Glu 

Ala 

3940 

Tyr 

Glu 

Gin 

Ala 

Val 

3945 

Ala 

Asn 

Gly 
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Asp 

Ser 

3950 

Glu 

Val 

Val 

Leu 

Lys 

3955 

Lys 

Leu 

Lys 

Lys 

Ser 

3960 

Leu 

Asn 

Val 

Ala 

Lys 

3965 

Ser 

Glu 

Phe 

Asp 

Arg 

3970 

Asp 

Ala 

Ala 

Met 

Gin 

3975 

Arg 

Lys 

Leu 

Glu 

Lys 

3980 

Met 

Ala 

Asp 

Gin 

Ala 

3985 

Met 

Thr 

Gin 

Met 

Tyr 

3990 

Lys 

Gin 

Ala 

Arg 

Ser 

3995 

Glu 

Asp 

Lys 

Arg 

Ala 

4000 

Lys 

Val 

Thr 

Ser 

Ala 

4005 

Met 

Gin 

Thr 

Met 

Leu 

4010 

Phe 

Thr 

Met 

Leu 

Arg 

4015 

Lys 

Leu 

Asp 

Asn 

Asp 

4020 

Ala 

Leu 

Asn 

Asn 

Ile 

4025 

Ile 

Asn 

Asn 

Ala 

Arg 

4030 

Asp 

Gly Cys 

Val 

Pro 

4035 

Leu 

Asn 

Ile 

Ile 

Pro 

4040 

Leu 

Thr 

Thr 

Ala 

Ala 

4045 

Lys 

Leu 

Met 

Val 

Val 

4050 

Val 

Pro 

Asp 

Tyr 

Gly 

4055 

Thr 

Tyr 

Lys 

Asn 

Thr 

4060 

Cys 

Asp 

Gly 

Asn 

Thr 

4065 

Phe 

Thr 

Tyr 

Ala 

Ser 

4070 

Ala 

Leu 

Trp 

Glu 

Ile 

4075 

Gin 

Gin 

Val 

Val 

Asp 

4080 

Ala 

Asp 

Ser 

Lys 

Ile 

4085 

Val 

Gin 

Leu 

Ser 

Glu 

4090 

Ile 

Asn 

Met 

Asp 

Asn 

4095 

Ser 

Pro 

Asn 

Leu 

Ala 

4100 

Trp 

Pro 

Leu 

Ile 

Val 

4105 

Thr 

Ala 

Leu 

Arg 

Ala 

4110 

Asn 

Ser 

Ala 

Val 

Lys 

4115 

Leu 

Gin 

Asn 

Asn 

Glu 

4120 

Leu 

Ser 

Pro 

Val 

Ala 

4125 

Leu 

Arg 

Gin 

Met 

Ser 

4130 

Cys 

Ala 

Ala 

Gly 

Thr 

4135 

Thr 

Gin 

Thr 

Ala 

Cys 

4140 

Thr 

Asp Asp 

Asn 

Ala 

4145 

Leu 

Ala 

Tyr 

Tyr 

Asn 

4150 

Asn 

Ser 

Lys 

Gly 

Gly 

4155 

Arg 

Phe 

Val 

Leu 

Ala 

4160 

Leu 

Leu 

Ser 

Asp 

His 

4165 

Gin 

Asp 

Leu 

Lys 

Trp 

4170 

Ala 

Arg 

Phe 

Pro 

Lys 

4175 

Ser 

Asp 

Gly 

Thr 

Gly 

4180 

Thr 

Ile 

Tyr 

Thr 

Glu 

4185 

Leu 

Glu 

Pro 

Pro 

Cys 

4190 

Arg 

Phe 

Val 

Thr 

Asp 

4195 

Thr 

Pro 

Lys 

Gly 

Pro 

4200 

Lys 

Val 

Lys 

Tyr 

Leu 

4205 

Tyr 

Phe 

Ile 

Lys 

Gly 

4210 

Leu 

Asn 

Asn 

Leu 

Asn 

4215 

Arg 

Gly Met 

Val 

Leu 

4220 

Gly 

Ser 

Leu 

Ala 

Ala 

4225 

Thr 

Val 

Arg 

Leu 

Gin 

4230 

Ala 

Gly Asn 

Ala 

Thr 

4235 

Glu 

Val 

Pro 

Ala 

Asn 

4240 

Ser 

Thr 

Val 

Leu 

Ser 

4245 

Phe 

Cys 

Ala 

Phe 

Ala 

4250 

Val 

Asp 

Pro 

Ala 

Lys 

4255 

Ala 

Tyr 

Lys 

Asp 

Tyr 

4260 

Leu 

Ala 

Ser 

Gly 

Gly 

4265 

Gin 

Pro 

Ile 

Thr 

Asn 

4270 

Cys 

Val 

Lys 

Met 

Leu 

4275 

Cys 

Thr 

His 

Thr 

Gly 

4280 

Thr 

Gly 

Gin 

Ala 

Ile 

4285 

Thr 

Val 

Thr 

Pro 

Glu 

4290 

Ala 

Asn 

Met 

Asp 

Gin 

4295 

Glu 

Ser 

Phe 

Gly 

Gly 

4300 

Ala 

Ser 

Cys 

Cys 

Leu 

4305 

Tyr 

Cys 

Arg 

Cys 

His 

4310 

Ile 

Asp 

His 

Pro 

Asn 

4315 

Pro 

Lys 

Gly 

Phe 

Cys 

4320 

Asp 

Leu 

Lys 

Gly 

Lys 

4325 

Tyr 

Val 

Gin 

Ile 

Pro 

4330 

Thr 

Thr 

Cys 

Ala 

Asn 

4335 

Asp 

Pro 

Val 

Gly 

Phe 

Thr 

Leu 

Arg 

Asn 

Thr 

Val 

Cys 

Thr 

Val 

Cys 

Gly 

Met 

Trp 
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4340 




4345 



4350 



Lys 

Gly 

4355 

Tyr 

Gly 

Cys 

Ser Cys 
4360 

Asp 

Gin 

Leu Arg Glu 
4365 

Pro 

Leu Met 

Gin 

Ser 

4370 

Ala 

Asp 

Ala 

Ser Thr 
4375 

Phe 

Leu 

Asn Gly Phe 
4380 

Ala 

Val 


<210> SEQ ID NO 3 

<211> LENGTH: 2695 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 3 


Arg 

Val 

Cys 

Gly 

Val 

Ser 

Ala 

Ala 

Arg 

Leu 

Thr 

Pro 

Cys 

Gly 

Thr 

Gly 

1 




5 





10 





15 


Thr 

Ser 

Thr 

Asp 

Val 

Val 

Tyr 

Arg 

Ala 

Phe 

Asp 

Ile 

Tyr 

Asn 

Glu 

Lys 




20 





25 





30 



Val 

Ala 

Gly 

Phe 

Ala 

Lys 

Phe 

Leu 

Lys 

Thr 

Asn 

Cys 

Cys 

Arg 

Phe 

Gin 



35 





40 





45 




Glu 

Lys 

Asp 

Glu 

Glu 

Gly 

Asn 

Leu 

Leu 

Asp 

Ser 

Tyr 

Phe 

Val 

Val 

Lys 


50 





55 





60 





Arg 

His 

Thr 

Met 

Ser 

Asn 

Tyr 

Gin 

His 

Glu 

Glu 

Thr 

Ile 

Tyr 

Asn 

Leu 

65 





70 





75 





80 

Val 

Lys 

Asp 

Cys 

Pro 

Ala 

Val 

Ala 

Val 

His 

Asp 

Phe 

Phe 

Lys 

Phe 

Arg 





85 





90 





95 


Val 

Asp 

Gly 

Asp 

Met 

Val 

Pro 

His 

Ile 

Ser 

Arg 

Gin 

Arg 

Leu 

Thr 

Lys 




100 





105 





110 



Tyr 

Thr 

Met 

Ala 

Asp 

Leu 

Val 

Tyr 

Ala 

Leu 

Arg 

His 

Phe 

Asp 

Glu Gly 



115 





120 





125 




Asn 

Cys 

Asp 

Thr 

Leu 

Lys 

Glu 

Ile 

Leu 

Val 

Thr 

Tyr 

Asn 

Cys 

Cys 

Asp 


130 





135 





140 





Asp 

Asp 

Tyr 

Phe 

Asn 

Lys 

Lys 

Asp 

Trp 

Tyr 

Asp 

Phe 

Val 

Glu 

Asn 

Pro 

145 





150 





155 





160 

Asp 

Ile 

Leu 

Arg 

Val 

Tyr 

Ala 

Asn 

Leu 

Gly 

Glu 

Arg 

Val 

Arg 

Gin 

Ser 





165 





170 





175 


Leu 

Leu 

Lys 

Thr 

Val 

Gin 

Phe 

Cys 

Asp 

Ala 

Met 

Arg 

Asp 

Ala 

Gly 

Ile 




180 





185 





190 



Val 

Gly 

Val 

Leu 

Thr 

Leu 

Asp 

Asn 

Gin 

Asp 

Leu 

Asn 

Gly 

Asn 

Trp 

Tyr 



195 





200 





205 




Asp 

Phe 

Gly 

Asp 

Phe 

Val 

Gin 

Val 

Ala 

Pro 

Gly 

Cys 

Gly 

Val 

Pro 

Ile 


210 





215 





220 





Val 

Asp 

Ser 

Tyr 

Tyr 

Ser 

Leu 

Leu 

Met 

Pro 

Ile 

Leu 

Thr 

Leu 

Thr 

Arg 

225 





230 





235 





240 

Ala 

Leu 

Ala 

Ala 

Glu 

Ser 

His 

Met 

Asp 

Ala 

Asp 

Leu 

Ala 

Lys 

Pro 

Leu 





245 





250 





255 


Ile 

Lys 

Trp 

Asp 

Leu 

Leu 

Lys 

Tyr 

Asp 

Phe 

Thr 

Glu 

Glu 

Arg 

Leu 

Cys 




260 





265 





270 



Leu 

Phe 

Asp 

Arg 

Tyr 

Phe 

Lys 

Tyr 

Trp 

Asp 

Gin 

Thr 

Tyr 

His 

Pro 

Asn 



275 





280 





285 




Cys 

Ile 

Asn 

Cys 

Leu 

Asp 

Asp 

Arg 

Cys 

Ile 

Leu 

His 

Cys 

Ala 

Asn 

Phe 


290 





295 





300 





Asn 

Val 

Leu 

Phe 

Ser 

Thr 

Val 

Phe 

Pro 

Pro 

Thr 

Ser 

Phe 

Gly 

Pro 

Leu 

305 





310 





315 





320 

Val 

Arg 

Lys 

Ile 

Phe 

Val 

Asp 

Gly 

Val 

Pro 

Phe 

Val 

Val 

Ser 

Thr 

Gly 





325 





330 





335 





US 7,220,852 Bl 

93 94 


-continued 


Tyr 

His 

Phe 

Arg 

Glu 

Leu 

Gly 

Val 

Val 

His 

Asn 

Gin 

Asp 

Val 

Asn 

Leu 




340 





345 





350 



His 

Ser 

Ser 

Arg 

Leu 

Ser 

Phe 

Lys 

Glu 

Leu 

Leu 

Val 

Tyr 

Ala 

Ala 

Asp 



355 





360 





365 




Pro 

Ala 

Met 

His 

Ala 

Ala 

Ser 

Gly 

Asn 

Leu 

Leu 

Leu 

Asp 

Lys 

Arg 

Thr 


370 





375 





380 





Thr 

Cys 

Phe 

Ser 

Val 

Ala 

Ala 

Leu 

Thr 

Asn 

Asn 

Val 

Ala 

Phe 

Gin 

Thr 

385 





390 





395 





400 

Val 

Lys 

Pro 

Gly 

Asn 

Phe 

Asn 

Lys 

Asp 

Phe 

Tyr 

Asp 

Phe 

Ala 

Val 

Ser 





405 





410 





415 


Lys 

Gly 

Phe 

Phe 

Lys 

Glu 

Gly 

Ser 

Ser 

Val 

Glu 

Leu 

Lys 

His 

Phe 

Phe 




420 





425 





430 



Phe 

Ala 

Gin 

Asp 

Gly 

Asn 

Ala 

Ala 

Ile 

Ser 

Asp 

Tyr 

Asp 

Tyr 

Tyr 

Arg 



435 





440 





445 




Tyr 

Asn 

Leu 

Pro 

Thr 

Met 

Cys 

Asp 

Ile 

Arg 

Gin 

Leu 

Leu 

Phe 

Val 

Val 


450 





455 





460 





Glu 

Val 

Val 

Asp 

Lys 

Tyr 

Phe 

Asp 

Cys 

Tyr 

Asp 

Gly 

Gly 

Cys 

Ile 

Asn 

465 





470 





475 





480 

Ala 

Asn 

Gin 

Val 

Ile 

Val 

Asn 

Asn 

Leu 

Asp 

Lys 

Ser 

Ala 

Gly 

Phe 

Pro 





485 





490 





495 


Phe 

Asn 

Lys 

Trp 

Gly 

Lys 

Ala 

Arg 

Leu 

Tyr 

Tyr 

Asp 

Ser 

Met 

Ser 

Tyr 




500 





505 





510 



Glu 

Asp 

Gin 

Asp 

Ala 

Leu 

Phe 

Ala 

Tyr 

Thr 

Lys 

Arg 

Asn 

Val 

Ile 

Pro 



515 





520 





525 




Thr 

Ile 

Thr 

Gin 

Met 

Asn 

Leu 

Lys 

Tyr 

Ala 

Ile 

Ser 

Ala 

Lys 

Asn 

Arg 


530 





535 





540 





Ala 

Arg 

Thr 

Val 

Ala 

Gly 

Val 

Ser 

Ile 

Cys 

Ser 

Thr 

Met 

Thr 

Asn 

Arg 

545 





550 





555 





560 

Gin 

Phe 

His 

Gin 

Lys 

Leu 

Leu 

Lys 

Ser 

Ile 

Ala 

Ala 

Thr 

Arg 

Gly 

Ala 





565 





570 





575 


Thr 

Val 

Val 

Ile 

Gly 

Thr 

Ser 

Lys 

Phe 

Tyr 

Gly 

Gly 

Trp 

His 

Asn 

Met 




580 





585 





590 



Leu 

Lys 

Thr 

Val 

Tyr 

Ser 

Asp 

Val 

Glu 

Thr 

Pro 

His 

Leu 

Met 

Gly 

Trp 



595 





600 





605 




Asp 

Tyr 

Pro 

Lys 

Cys 

Asp 

Arg 

Ala 

Met 

Pro 

Asn 

Met 

Leu 

Arg 

Ile 

Met 


610 





615 





620 





Ala 

Ser 

Leu 

Val 

Leu 

Ala 

Arg 

Lys 

His 

Asn 

Thr 

Cys 

Cys 

Asn 

Leu 

Ser 

625 





630 





635 





640 

His 

Arg 

Phe 

Tyr 

Arg 

Leu 

Ala 

Asn 

Glu 

Cys 

Ala 

Gin 

Val 

Leu 

Ser 

Glu 





645 





650 





655 


Met 

Val 

Met 

Cys 

Gly 

Gly 

Ser 

Leu 

Tyr 

Val 

Lys 

Pro 

Gly 

Gly 

Thr 

Ser 




660 





665 





670 



Ser 

Gly 

Asp 

Ala 

Thr 

Thr 

Ala 

Tyr 

Ala 

Asn 

Ser 

Val 

Phe 

Asn 

Ile 

Cys 



675 





680 





685 




Gin 

Ala 

Val 

Thr 

Ala 

Asn 

Val 

Asn 

Ala 

Leu 

Leu 

Ser 

Thr 

Asp 

Gly 

Asn 


690 





695 





700 





Lys 

Ile 

Ala 

Asp 

Lys 

Tyr 

Val 

Arg 

Asn 

Leu 

Gin 

His 

Arg 

Leu 

Tyr 

Glu 

705 





710 





715 





720 

Cys 

Leu 

Tyr 

Arg 

Asn 

Arg 

Asp 

Val 

Asp 

His 

Glu 

Phe 

Val 

Asp 

Glu 

Phe 





725 





730 





735 


Tyr 

Ala 

Tyr 

Leu 

Arg 

Lys 

His 

Phe 

Ser 

Met 

Met 

Ile 

Leu 

Ser 

Asp 

Asp 




740 





745 





750 



Ala 

Val 

Val 

Cys 

Tyr 

Asn 

Ser 

Asn 

Tyr 

Ala 

Ala 

Gin 

Gly 

Leu 

Val 

Ala 
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755 760 765 


Ser 

Ile 

Lys 

Asn 

Phe 

Lys 

Ala 

Val 

Leu 

Tyr 

Tyr 

Gin 

Asn 

Asn 

Val 

Phe 


770 





775 





780 





Met 

Ser 

Glu 

Ala 

Lys 

Cys 

Trp 

Thr 

Glu 

Thr 

Asp 

Leu 

Thr 

Lys 

Gly 

Pro 

785 





790 





795 





800 

His 

Glu 

Phe 

Cys 

Ser 

Gin 

His 

Thr 

Met 

Leu 

Val 

Lys 

Gin 

Gly 

Asp 

Asp 





805 





810 





815 


Tyr 

Val 

Tyr 

Leu 

Pro 

Tyr 

Pro 

Asp 

Pro 

Ser 

Arg 

Ile 

Leu 

Gly 

Ala 

Gly 




820 





825 





830 



Cys 

Phe 

Val 

Asp 

Asp 

Ile 

Val 

Lys 

Thr 

Asp 

Gly 

Thr 

Leu 

Met 

Ile 

Glu 



835 





840 





845 




Arg 

Phe 

Val 

Ser 

Leu 

Ala 

Ile 

Asp 

Ala 

Tyr 

Pro 

Leu 

Thr 

Lys 

His 

Pro 


850 





855 





860 





Asn 

Gin 

Glu 

Tyr 

Ala 

Asp 

Val 

Phe 

His 

Leu 

Tyr 

Leu 

Gin 

Tyr 

Ile 

Arg 

865 





870 





875 





880 

Lys 

Leu 

His 

Asp 

Glu 

Leu 

Thr 

Gly 

His 

Met 

Leu 

Asp 

Met 

Tyr 

Ser 

Val 





885 





890 





895 


Met 

Leu 

Thr 

Asn 

Asp 

Asn 

Thr 

Ser 

Arg 

Tyr 

Trp 

Glu 

Pro 

Glu 

Phe 

Tyr 




900 





905 





910 



Glu 

Ala 

Met 

Tyr 

Thr 

Pro 

His 

Thr 

Val 

Leu 

Gin 

Ala 

Val 

Gly 

Ala 

Cys 



915 





920 





925 




Val 

Leu 

Cys 

Asn 

Ser 

Gin 

Thr 

Ser 

Leu 

Arg 

Cys 

Gly 

Ala 

Cys 

Ile 

Arg 


930 





935 





940 





Arg 

Pro 

Phe 

Leu 

Cys 

Cys 

Lys 

Cys 

Cys 

Tyr 

Asp 

His 

Val 

Ile 

Ser 

Thr 

945 





950 





955 





960 

Ser 

His 

Lys 

Leu 

Val 

Leu 

Ser 

Val 

Asn 

Pro 

Tyr 

Val 

Cys 

Asn 

Ala 

Pro 





965 





970 





975 


Gly Cys 

Asp 

Val 

Thr 

Asp 

Val 

Thr 

Gin 

Leu 

Tyr 

Leu 

Gly 

Gly 

Met 

Ser 




980 





985 





990 




Tyr Tyr Cys Lys Ser His Lys Pro Pro Ile Ser Phe Pro Leu Cys Ala 
995 1000 1005 


Asn 

Gly 

1010 

Gin 

Val 

Phe 

Gly 

Leu 

1015 

Tyr 

Lys 

Asn 

Thr 

Cys 

1020 

Val 

Gly 

Ser 

Asp 

Asn 

1025 

Val 

Thr 

Asp 

Phe 

Asn 

1030 

Ala 

Ile 

Ala 

Thr 

Cys 

1035 

Asp 

Trp 

Thr 

Asn 

Ala 

1040 

Gly 

Asp 

Tyr 

Ile 

Leu 

1045 

Ala 

Asn 

Thr 

Cys 

Thr 

1050 

Glu 

Arg 

Leu 

Lys 

Leu 

1055 

Phe 

Ala 

Ala 

Glu 

Thr 

1060 

Leu 

Lys 

Ala 

Thr 

Glu 

1065 

Glu 

Thr 

Phe 

Lys 

Leu 

1070 

Ser 

Tyr 

Gly 

Ile 

Ala 

1075 

Thr 

Val 

Arg 

Glu 

Val 

1080 

Leu 

Ser 

Asp 

Arg 

Glu 

1085 

Leu 

His 

Leu 

Ser 

Trp 

1090 

Glu 

Val 

Gly 

Lys 

Pro 

1095 

Arg 

Pro 

Pro 

Leu 

Asn 

1100 

Arg 

Asn 

Tyr 

Val 

Phe 

1105 

Thr 

Gly 

Tyr 

Arg 

Val 

1110 

Thr 

Lys 

Asn 

Ser 

Lys 

1115 

Val 

Gin 

Ile 

Gly 

Glu 

1120 

Tyr 

Thr 

Phe 

Glu 

Lys 

1125 

Gly 

Asp 

Tyr 

Gly Asp 
1130 

Ala 

Val 

Val 

Tyr 

Arg 

1135 

Gly 

Thr 

Thr 

Thr 

Tyr 

1140 

Lys 

Leu 

Asn 

Val 

Gly 

1145 

Asp 

Tyr 

Phe 

Val 

Leu 

1150 

Thr 

Ser 

His 

Thr 

Val 

1155 

Met 

Pro 

Leu 


Ser Ala Pro Thr Leu Val Pro Gin Glu His Tyr Val Arg Ile Thr 
1160 1165 1170 
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Gly 

Leu 

1175 

Tyr 

Pro 

Thr 

Leu 

Asn 

1180 

Ile 

Ser 

Asp 

Glu 

Phe 

1185 

Ser 

Ser 

Asn 

Val 

Ala 

1190 

Asn 

Tyr 

Gin 

Lys 

Val 

1195 

Gly 

Met 

Gin 

Lys 

Tyr 

1200 

Ser 

Thr 

Leu 

Gin 

Gly 

1205 

Pro 

Pro 

Gly 

Thr 

Gly 

1210 

Lys 

Ser 

His 

Phe 

Ala 

1215 

Ile 

Gly 

Leu 

Ala 

Leu 

1220 

Tyr 

Tyr 

Pro 

Ser 

Ala 

1225 

Arg 

Ile 

Val 

Tyr 

Thr 

1230 

Ala 

Cys 

Ser 

His 

Ala 

1235 

Ala 

Val 

Asp 

Ala 

Leu 

1240 

Cys 

Glu 

Lys 

Ala 

Leu 

1245 

Lys 

Tyr 

Leu 

Pro 

Ile 

1250 

Asp 

Lys 

Cys 

Ser 

Arg 

1255 

Ile 

Ile 

Pro 

Ala 

Arg 

1260 

Ala 

Arg 

Val 

Glu 

Cys 

1265 

Phe 

Asp 

Lys 

Phe 

Lys 

1270 

Val 

Asn 

Ser 

Thr 

Leu 

1275 

Glu 

Gin 

Tyr 

Val 

Phe 

1280 

Cys 

Thr 

Val 

Asn 

Ala 

1285 

Leu 

Pro 

Glu 

Thr 

Thr 

1290 

Ala 

Asp 

Ile 

Val 

Val 

1295 

Phe 

Asp 

Glu 

Ile 

Ser 

1300 

Met 

Ala 

Thr 

Asn 

Tyr 

1305 

Asp 

Leu 

Ser 

Val 

Val 

1310 

Asn 

Ala 

Arg 

Leu 

Arg 

1315 

Ala 

Lys 

His 

Tyr 

Val 

1320 

Tyr 

Ile Gly 

Asp 

Pro 

1325 

Ala 

Gin 

Leu 

Pro 

Ala 

1330 

Pro 

Arg 

Thr 

Leu 

Leu 

1335 

Thr 

Lys 

Gly 

Thr 

Leu 

1340 

Glu 

Pro 

Glu 

Tyr 

Phe 

1345 

Asn 

Ser 

Val 

Cys 

Arg 

1350 

Leu 

Met 

Lys 

Thr 

Ile 

1355 

Gly 

Pro 

Asp 

Met 

Phe 

1360 

Leu 

Gly Thr 

Cys 

Arg 

1365 

Arg 

Cys 

Pro 

Ala 

Glu 

1370 

Ile 

Val 

Asp 

Thr 

Val 

1375 

Ser 

Ala 

Leu 

Val 

Tyr 

1380 

Asp 

Asn 

Lys 

Leu 

Lys 

1385 

Ala 

His 

Lys 

Asp 

Lys 

1390 

Ser 

Ala 

Gin 

Cys 

Phe 

1395 

Lys 

Met 

Phe 

Tyr 

Lys 

1400 

Gly Val 

Ile 

Thr 

His 

1405 

Asp 

Val 

Ser 

Ser 

Ala 

1410 

Ile 

Asn 

Arg 

Pro 

Gin 

1415 

Ile 

Gly Val 

Val 

Arg 

1420 

Glu 

Phe 

Leu 

Thr 

Arg 

1425 

Asn 

Pro 

Ala 

Trp 

Arg 

1430 

Lys 

Ala 

Val 

Phe 

Ile 

1435 

Ser 

Pro 

Tyr 

Asn 

Ser 

1440 

Gin 

Asn 

Ala 

Val 

Ala 

1445 

Ser 

Lys 

Ile 

Leu 

Gly 

1450 

Leu 

Pro 

Thr 

Gin 

Thr 

1455 

Val 

Asp 

Ser 

Ser 

Gin 

1460 

Gly 

Ser 

Glu 

Tyr 

Asp 

1465 

Tyr 

Val 

Ile 

Phe 

Thr 

1470 

Gin 

Thr 

Thr 

Glu 

Thr 

1475 

Ala 

His 

Ser 

Cys 

Asn 

1480 

Val 

Asn 

Arg 

Phe 

Asn 

1485 

Val 

Ala 

Ile 

Thr 

Arg 

1490 

Ala 

Lys 

Ile 

Gly 

Ile 

1495 

Leu 

Cys 

Ile 

Met 

Ser 

1500 

Asp 

Arg 

Asp 

Leu 

Tyr 

1505 

Asp 

Lys 

Leu 

Gin 

Phe 

1510 

Thr 

Ser 

Leu 

Glu 

Ile 

1515 

Pro 

Arg 

Arg 

Asn 

Val 

1520 

Ala 

Thr 

Leu 

Gin 

Ala 

1525 

Glu 

Asn 

Val 

Thr 

Gly 

1530 

Leu 

Phe 

Lys 

Asp 

Cys 

1535 

Ser 

Lys 

Ile 

Ile 

Thr 

1540 

Gly 

Leu 

His 

Pro 

Thr 

1545 

Gin 

Ala 

Pro 

Thr 

His 

1550 

Leu 

Ser 

Val 

Asp 

Ile 

1555 

Lys 

Phe 

Lys 

Thr 

Glu 

1560 

Gly 

Leu 

Cys 
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Val 

Asp 

1565 

Ile 

Pro 

Gly 

Ile 

Pro 

1570 

Lys 

Asp 

Met 

Thr 

Tyr 

1575 

Arg 

Arg 

Leu 

Ile 

Ser 

1580 

Met 

Met 

Gly 

Phe 

Lys 

1585 

Met 

Asn 

Tyr 

Gin 

Val 

1590 

Asn 

Gly Tyr 

Pro 

Asn 

1595 

Met 

Phe 

Ile 

Thr 

Arg 

1600 

Glu 

Glu 

Ala 

Ile 

Arg 

1605 

His 

Val 

Arg 

Ala 

Trp 

1610 

Ile 

Gly 

Phe 

Asp 

Val 

1615 

Glu 

Gly Cys 

His 

Ala 

1620 

Thr 

Arg 

Asp 

Ala 

Val 

1625 

Gly 

Thr 

Asn 

Leu 

Pro 

1630 

Leu 

Gin 

Leu 

Gly 

Phe 

1635 

Ser 

Thr 

Gly 

Val 

Asn 

1640 

Leu 

Val 

Ala 

Val 

Pro 

1645 

Thr 

Gly Tyr 

Val 

Asp 

1650 

Thr 

Glu 

Asn 

Asn 

Thr 

1655 

Glu 

Phe 

Thr 

Arg 

Val 

1660 

Asn 

Ala 

Lys 

Pro 

Pro 

1665 

Pro 

Gly Asp 

Gin 

Phe 

1670 

Lys 

His 

Leu 

Ile 

Pro 

1675 

Leu 

Met 

Tyr 

Lys 

Gly 

1680 

Leu 

Pro 

Trp 

Asn 

Val 

1685 

Val 

Arg 

Ile 

Lys 

Ile 

1690 

Val 

Gin 

Met 

Leu 

Ser 

1695 

Asp 

Thr 

Leu 

Lys 

Gly 

1700 

Leu 

Ser 

Asp 

Arg 

Val 

1705 

Val 

Phe 

Val 

Leu 

Trp 

1710 

Ala 

His 

Gly 

Phe 

Glu 

1715 

Leu 

Thr 

Ser 

Met 

Lys 

1720 

Tyr 

Phe 

Val 

Lys 

Ile 

1725 

Gly 

Pro 

Glu 

Arg 

Thr 

1730 

Cys 

Cys 

Leu 

Cys 

Asp 

1735 

Lys 

Arg 

Ala 

Thr 

Cys 

1740 

Phe 

Ser 

Thr 

Ser 

Ser 

1745 

Asp 

Thr 

Tyr 

Ala 

Cys 

1750 

Trp 

Asn 

His 

Ser 

Val 

1755 

Gly 

Phe 

Asp 

Tyr 

Val 

1760 

Tyr 

Asn 

Pro 

Phe 

Met 

1765 

Ile 

Asp 

Val 

Gin 

Gin 

1770 

Trp 

Gly Phe 

Thr 

Gly 

1775 

Asn 

Leu 

Gin 

Ser 

Asn 

1780 

His 

Asp 

Gin 

His 

Cys 

1785 

Gin 

Val 

His 

Gly 

Asn 

1790 

Ala 

His 

Val 

Ala 

Ser 

1795 

Cys 

Asp 

Ala 

Ile 

Met 

1800 

Thr 

Arg 

Cys 

Leu 

Ala 

1805 

Val 

His 

Glu 

Cys 

Phe 

1810 

Val 

Lys 

Arg 

Val 

Asp 

1815 

Trp 

Ser 

Val 

Glu 

Tyr 

1820 

Pro 

Ile 

Ile 

Gly 

Asp 

1825 

Glu 

Leu 

Arg 

Val 

Asn 

1830 

Ser 

Ala 

Cys 

Arg 

Lys 

1835 

Val 

Gin 

His 

Met 

Val 

1840 

Val 

Lys 

Ser 

Ala 

Leu 

1845 

Leu 

Ala 

Asp 

Lys 

Phe 

1850 

Pro 

Val 

Leu 

His 

Asp 

1855 

Ile 

Gly Asn 

Pro 

Lys 

1860 

Ala 

Ile 

Lys 

Cys 

Val 

1865 

Pro 

Gin 

Ala 

Glu 

Val 

1870 

Glu 

Trp 

Lys 

Phe 

Tyr 

1875 

Asp 

Ala 

Gin 

Pro 

Cys 

1880 

Ser 

Asp 

Lys 

Ala 

Tyr 

1885 

Lys 

Ile 

Glu 

Glu 

Leu 

1890 

Phe 

Tyr 

Ser 

Tyr 

Ala 

1895 

Thr 

His 

His 

Asp 

Lys 

1900 

Phe 

Thr 

Asp 

Gly 

Val 

1905 

Cys 

Leu 

Phe 

Trp 

Asn 

1910 

Cys 

Asn 

Val 

Asp 

Arg 

1915 

Tyr 

Pro 

Ala 

Asn 

Ala 

1920 

Ile 

Val 

Cys 

Arg 

Phe 

1925 

Asp 

Thr 

Arg 

Val 

Leu 

1930 

Ser 

Asn 

Leu 

Asn 

Leu 

1935 

Pro 

Gly Cys 

Asp 

Gly 

1940 

Gly 

Ser 

Leu 

Tyr 

Val 

1945 

Asn 

Lys 

His 

Ala 

Phe 

1950 

His 

Thr 

Pro 

Ala 

Phe 

Asp 

Lys 

Ser 

Ala 

Phe 

Thr 

Asn 

Leu 

Lys 

Gin 

Leu 

Pro 

Phe 
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1955 





1960 





1965 




Phe 

Tyr 

1970 

Tyr 

Ser 

Asp 

Ser 

Pro 

1975 

Cys 

Glu 

Ser 

His 

Gly 

1980 

Lys 

Gin 

Val 

Val 

Ser 

1985 

Asp 

Ile 

Asp 

Tyr 

Val 

1990 

Pro 

Leu 

Lys 

Ser 

Ala 

1995 

Thr 

Cys 

Ile 

Thr 

Arg 

2000 

Cys 

Asn 

Leu 

Gly 

Gly 

2005 

Ala 

Val 

Cys 

Arg 

His 

2010 

His 

Ala 

Asn 

Glu 

Tyr 

2015 

Arg 

Gin 

Tyr 

Leu 

Asp 

2020 

Ala 

Tyr 

Asn 

Met 

Met 

2025 

Ile 

Ser 

Ala 

Gly 

Phe 

2030 

Ser 

Leu 

Trp 

Ile 

Tyr 

2035 

Lys 

Gin 

Phe 

Asp 

Thr 

2040 

Tyr 

Asn 

Leu 

Trp 

Asn 

2045 

Thr 

Phe 

Thr 

Arg 

Leu 

2050 

Gin 

Ser 

Leu 

Glu 

Asn 

2055 

Val 

Ala 

Tyr 

Asn 

Val 

2060 

Val 

Asn 

Lys 

Gly 

His 

2065 

Phe 

Asp 

Gly 

His 

Ala 

2070 

Gly 

Glu 

Ala 

Pro 

Val 

2075 

Ser 

Ile 

Ile 

Asn 

Asn 

2080 

Ala 

Val 

Tyr 

Thr 

Lys 

2085 

Val 

Asp Gly 

Ile 

Asp 

2090 

Val 

Glu 

Ile 

Phe 

Glu 

2095 

Asn 

Lys 

Thr 

Thr 

Leu 

2100 

Pro 

Val 

Asn 

Val 

Ala 

2105 

Phe 

Glu 

Leu 

Trp 

Ala 

2110 

Lys 

Arg 

Asn 

Ile 

Lys 

2115 

Pro 

Val 

Pro 

Glu 

Ile 

2120 

Lys 

Ile 

Leu 

Asn 

Asn 

2125 

Leu 

Gly Val 

Asp 

Ile 

2130 

Ala 

Ala 

Asn 

Thr 

Val 

2135 

Ile 

Trp 

Asp 

Tyr 

Lys 

2140 

Arg 

Glu 

Ala 

Pro 

Ala 

2145 

His 

Val 

Ser 

Thr 

Ile 

2150 

Gly 

Val 

Cys 

Thr 

Met 

2155 

Thr 

Asp 

Ile 

Ala 

Lys 

2160 

Lys 

Pro 

Thr 

Glu 

Ser 

2165 

Ala 

Cys 

Ser 

Ser 

Leu 

2170 

Thr 

Val 

Leu 

Phe 

Asp 

2175 

Gly 

Arg 

Val 

Glu 

Gly 

2180 

Gin 

Val 

Asp 

Leu 

Phe 

2185 

Arg 

Asn 

Ala 

Arg 

Asn 

2190 

Gly 

Val 

Leu 

Ile 

Thr 

2195 

Glu 

Gly 

Ser 

Val 

Lys 

2200 

Gly 

Leu 

Thr 

Pro 

Ser 

2205 

Lys 

Gly Pro 

Ala 

Gin 

2210 

Ala 

Ser 

Val 

Asn 

Gly 

2215 

Val 

Thr 

Leu 

Ile 

Gly 

2220 

Glu 

Ser 

Val 

Lys 

Thr 

2225 

Gin 

Phe 

Asn 

Tyr 

Phe 

2230 

Lys 

Lys 

Val 

Asp 

Gly 

2235 

Ile 

Ile 

Gin 

Gin 

Leu 

2240 

Pro 

Glu 

Thr 

Tyr 

Phe 

2245 

Thr 

Gin 

Ser 

Arg 

Asp 

2250 

Leu 

Glu 

Asp 

Phe 

Lys 

2255 

Pro 

Arg 

Ser 

Gin 

Met 

2260 

Glu 

Thr 

Asp 

Phe 

Leu 

2265 

Glu 

Leu 

Ala 

Met 

Asp 

2270 

Glu 

Phe 

Ile 

Gin 

Arg 

2275 

Tyr 

Lys 

Leu 

Glu 

Gly 

2280 

Tyr 

Ala 

Phe 

Glu 

His 

2285 

Ile 

Val 

Tyr 

Gly 

Asp 

2290 

Phe 

Ser 

His 

Gly 

Gin 

2295 

Leu 

Gly Gly 

Leu 

His 

2300 

Leu 

Met 

Ile 

Gly 

Leu 

2305 

Ala 

Lys 

Arg 

Ser 

Gin 

2310 

Asp 

Ser 

Pro 

Leu 

Lys 

2315 

Leu 

Glu 

Asp 

Phe 

Ile 

2320 

Pro 

Met 

Asp 

Ser 

Thr 

2325 

Val 

Lys 

Asn 

Tyr 

Phe 

2330 

Ile 

Thr 

Asp 

Ala 

Gin 

2335 

Thr 

Gly 

Ser 

Ser 

Lys 

2340 

Cys 

Val 

Cys 

Ser 

Val 

2345 

Ile 

Asp 

Leu 

Leu 

Leu 

2350 

Asp 

Asp 

Phe 

Val 

Glu 

2355 

Ile 

Ile 

Lys 
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Ser 

Gin 

2360 

Asp 

Leu 

Ser 

Val 

Ile 

2365 

Ser 

Lys 

Val 

Val 

Lys 

2370 

Val 

Thr 

Ile 

Asp 

Tyr 

2375 

Ala 

Glu 

Ile 

Ser 

Phe 

2380 

Met 

Leu 

Trp 

Cys 

Lys 

2385 

Asp 

Gly His 

Val 

Glu 

2390 

Thr 

Phe 

Tyr 

Pro 

Lys 

2395 

Leu 

Gin 

Ala 

Ser 

Gin 

2400 

Ala 

Trp 

Gin 

Pro 

Gly 

2405 

Val 

Ala 

Met 

Pro 

Asn 

2410 

Leu 

Tyr 

Lys 

Met 

Gin 

2415 

Arg 

Met 

Leu 

Leu 

Glu 

2420 

Lys 

Cys 

Asp 

Leu 

Gin 

2425 

Asn 

Tyr 

Gly 

Glu 

Asn 

2430 

Ala 

Val 

Ile 

Pro 

Lys 

2435 

Gly 

Ile 

Met 

Met 

Asn 

2440 

Val 

Ala 

Lys 

Tyr 

Thr 

2445 

Gin 

Leu 

Cys 

Gin 

Tyr 

2450 

Leu 

Asn 

Thr 

Leu 

Thr 

2455 

Leu 

Ala 

Val 

Pro 

Tyr 

2460 

Asn 

Met 

Arg 

Val 

Ile 

2465 

His 

Phe 

Gly 

Ala 

Gly 

2470 

Ser 

Asp 

Lys 

Gly 

Val 

2475 

Ala 

Pro 

Gly 

Thr 

Ala 

2480 

Val 

Leu 

Arg 

Gin 

Trp 

2485 

Leu 

Pro 

Thr 

Gly 

Thr 

2490 

Leu 

Leu 

Val 

Asp 

Ser 

2495 

Asp 

Leu 

Asn 

Asp 

Phe 

2500 

Val 

Ser 

Asp 

Ala 

Asp 

2505 

Ser 

Thr 

Leu 

Ile 

Gly 

2510 

Asp 

Cys 

Ala 

Thr 

Val 

2515 

His 

Thr 

Ala 

Asn 

Lys 

2520 

Trp 

Asp 

Leu 

Ile 

Ile 

2525 

Ser 

Asp 

Met 

Tyr 

Asp 

2530 

Pro 

Arg 

Thr 

Lys 

His 

2535 

Val 

Thr 

Lys 

Glu 

Asn 

2540 

Asp 

Ser 

Lys 

Glu 

Gly 

2545 

Phe 

Phe 

Thr 

Tyr 

Leu 

2550 

Cys 

Gly Phe 

Ile 

Lys 

2555 

Gin 

Lys 

Leu 

Ala 

Leu 

2560 

Gly 

Gly 

Ser 

Ile 

Ala 

2565 

Val 

Lys 

Ile 

Thr 

Glu 

2570 

His 

Ser 

Trp 

Asn 

Ala 

2575 

Asp 

Leu 

Tyr 

Lys 

Leu 

2580 

Met 

Gly His 

Phe 

Ser 

2585 

Trp 

Trp 

Thr 

Ala 

Phe 

2590 

Val 

Thr 

Asn 

Val 

Asn 

2595 

Ala 

Ser 

Ser 

Ser 

Glu 

2600 

Ala 

Phe 

Leu 

Ile 

Gly 

2605 

Ala 

Asn 

Tyr 

Leu 

Gly 

2610 

Lys 

Pro 

Lys 

Glu 

Gin 

2615 

Ile 

Asp 

Gly 

Tyr 

Thr 

2620 

Met 

His 

Ala 

Asn 

Tyr 

2625 

Ile 

Phe 

Trp 

Arg 

Asn 

2630 

Thr 

Asn 

Pro 

Ile 

Gin 

2635 

Leu 

Ser 

Ser 

Tyr 

Ser 

2640 

Leu 

Phe 

Asp 

Met 

Ser 

2645 

Lys 

Phe 

Pro 

Leu 

Lys 

2650 

Leu 

Arg 

Gly 

Thr 

Ala 

2655 

Val 

Met 

Ser 

Leu 

Lys 

2660 

Glu 

Asn 

Gin 

Ile 

Asn 

2665 

Asp 

Met 

Ile 

Tyr 

Ser 

2670 

Leu 

Leu 

Glu 

Lys 

Gly 

2675 

Arg 

Leu 

Ile 

Ile 

Arg 

2680 

Glu 

Asn 

Asn 

Arg 

Val 

2685 

Val 

Val 

Ser 

Ser 

Asp 

2690 

Ile 

Leu 

Val 

Asn 

Asn 

2695 










<210> SEQ ID NO 4 

<211> LENGTH: 1255 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 

<400> SEQUENCE: 4 

Met Phe Ile Phe Leu Leu Phe Leu Thr Leu Thr Ser Gly Ser Asp Leu 




105 


1 




5 





10 

Asp 

Arg 

Cys 

Thr 

Thr 

Phe 

Asp 

Asp 

Val 

Gin 




20 





25 


His 

Thr 

Ser 

Ser 

Met 

Arg 

Gly 

Val 

Tyr 

Tyr 



35 





40 



Ser 

Asp 

Thr 

Leu 

Tyr 

Leu 

Thr 

Gin 

Asp 

Leu 


50 





55 




Asn 

Val 

Thr 

Gly 

Phe 

His 

Thr 

Ile 

Asn 

His 

65 





70 





Ile 

Pro 

Phe 

Lys 

Asp 

Gly 

Ile 

Tyr 

Phe 

Ala 





85 





90 

Val 

Val 

Arg 

Gly 

Trp 

Val 

Phe 

Gly 

Ser 

Thr 




100 





105 


Ser 

Val 

Ile 

Ile 

Ile 

Asn 

Asn 

Ser 

Thr 

Asn 



115 





120 



Asn 

Phe 

Glu 

Leu 

Cys 

Asp 

Asn 

Pro 

Phe 

Phe 


130 





135 




Gly Thr 

Gin 

Thr 

His 

Thr 

Met 

Ile 

Phe 

Asp 

145 





150 





Phe 

Glu 

Tyr 

Ile 

Ser 

Asp 

Ala 

Phe 

Ser 

Leu 





165 





170 

Gly Asn 

Phe 

Lys 

His 

Leu 

Arg 

Glu 

Phe 

Val 




180 





185 


Phe 

Leu 

Tyr 

Val 

Tyr 

Lys 

Gly 

Tyr 

Gin 

Pro 



195 





200 



Leu 

Pro 

Ser 

Gly 

Phe 

Asn 

Thr 

Leu 

Lys 

Pro 


210 





215 




Gly 

Ile 

Asn 

Ile 

Thr 

Asn 

Phe 

Arg 

Ala 

Ile 

225 





230 





Ala 

Gin 

Asp 

Ile 

Trp 

Gly 

Thr 

Ser 

Ala 

Ala 





245 





250 

Leu 

Lys 

Pro 

Thr 

Thr 

Phe 

Met 

Leu 

Lys 

Tyr 




260 





265 


Thr 

Asp 

Ala 

Val 

Asp 

Cys 

Ser 

Gin 

Asn 

Pro 



275 





280 



Ser 

Val 

Lys 

Ser 

Phe 

Glu 

Ile 

Asp 

Lys 

Gly 


290 





295 




Phe 

Arg 

Val 

Val 

Pro 

Ser 

Gly 

Asp 

Val 

Val 

305 





310 





Asn 

Leu 

Cys 

Pro 

Phe 

Gly 

Glu 

Val 

Phe 

Asn 





325 





330 

Val 

Tyr 

Ala 

Trp 

Glu 

Arg 

Lys 

Lys 

Ile 

Ser 




340 





345 


Ser 

Val 

Leu 

Tyr 

Asn 

Ser 

Thr 

Phe 

Phe 

Ser 



355 





360 



Val 

Ser 

Ala 

Thr 

Lys 

Leu 

Asn 

Asp 

Leu 

Cys 


370 





375 




Asp 

Ser 

Phe 

Val 

Val 

Lys 

Gly 

Asp 

Asp 

Val 

385 





390 





Gin 

Thr 

Gly 

Val 

Ile 

Ala 

Asp 

Tyr 

Asn 

Tyr 





405 





410 

Met 

Gly 

Cys 

Val 

Leu 

Ala 

Trp 

Asn 

Thr 

Arg 
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15 


Ala 

Pro 

Asn 

Tyr 

30 

Thr 

Gin 

Pro 

Asp 

Glu 

45 

Ile 

Phe 

Arg 

Phe 

Leu 

60 

Pro 

Phe 

Tyr 

Ser 

Thr 

75 

Phe 

Gly 

Asn 

Pro 

Val 

80 

Ala 

Thr 

Glu 

Lys 

Ser 

95 

Asn 

Met 

Asn 

Asn 

Lys 

110 

Ser 

Gin 

Val 

Val 

Ile 

125 

Arg 

Ala 

Cys 

Ala 

Val 

140 

Ser 

Lys 

Pro 

Met 

Asn 

155 

Ala 

Phe 

Asn 

Cys 

Thr 

160 

Asp 

Val 

Ser 

Glu 

Lys 

175 

Ser 

Phe 

Lys 

Asn 

Lys 

190 

Asp 

Gly 

Ile 

Asp 

Val 

205 

Val 

Arg 

Asp 

Ile 

Phe 

220 

Lys 

Leu 

Pro 

Leu 

Leu 

235 

Thr 

Ala 

Phe 

Ser 

Pro 

240 

Ala 

Tyr 

Phe 

Val 

Gly 

255 

Tyr 

Asp 

Glu 

Asn 

Gly 

270 

Thr 

Ile 

Leu 

Ala 

Glu 

285 

Leu 

Lys 

Cys 

Ile 

Tyr 

300 

Gin 

Thr 

Ser 

Asn 

Arg 

315 

Phe 

Pro 

Asn 

Ile 

Thr 

320 

Ala 

Thr 

Lys 

Phe 

Pro 

335 

Ser 

Asn 

Cys 

Val 

Ala 

350 

Asp 

Tyr 

Thr 

Phe 

Lys 

365 

Cys 

Tyr 

Gly 

Phe 

Ser 

380 

Asn 

Val 

Tyr 

Ala 

Arg 

395 

Gin 

Ile 

Ala 

Pro 

Gly 

400 

Lys 

Leu 

Pro 

Asp 

Asp 

415 

Phe 

Asn 

Ile 

Asp 

Ala 

430 

Thr 

Ser 




107 


Thr 

Gly 

Asn 

Tyr 

Asn 

Tyr 

Lys 

Tyr 

Arg 

Tyr 

Leu 



435 





440 




Arg 

Pro 

Phe 

Glu 

Arg 

Asp 

Ile 

Ser 

Asn 

Val 

Pro 


450 





455 





Lys 

Pro 

Cys 

Thr 

Pro 

Pro 

Ala 

Leu 

Asn 

Cys 

Tyr 

465 





470 





475 

Tyr 

Gly 

Phe 

Tyr 

Thr 

Thr 

Thr 

Gly 

Ile 

Gly 

Tyr 





485 





490 


Val 

Val 

Leu 

Ser 

Phe 

Glu 

Leu 

Leu 

Asn 

Ala 

Pro 




500 





505 



Pro 

Lys 

Leu 

Ser 

Thr 

Asp 

Leu 

Ile 

Lys 

Asn 

Gin 



515 





520 




Phe 

Asn 

Gly 

Leu 

Thr 

Gly 

Thr 

Gly 

Val 

Leu 

Thr 


530 





535 





Phe 

Gin 

Pro 

Phe 

Gin 

Gin 

Phe 

Gly 

Arg 

Asp 

Val 

545 





550 





555 

Ser 

Val 

Arg 

Asp 

Pro 

Lys 

Thr 

Ser 

Glu 

Ile 

Leu 





565 





570 


Ser 

Phe 

Gly 

Gly 

Val 

Ser 

Val 

Ile 

Thr 

Pro 

Gly 




580 





585 



Glu 

Val 

Ala 

Val 

Leu 

Tyr 

Gin 

Asp 

Val 

Asn 

Cys 



595 





600 




Ala 

Ile 

His 

Ala 

Asp 

Gin 

Leu 

Thr 

Pro 

Ala 

Trp 


610 





615 





Gly Asn 

Asn 

Val 

Phe 

Gin 

Thr 

Gin 

Ala 

Gly 

Cys 

625 





630 





635 

His 

Val 

Asp 

Thr 

Ser 

Tyr 

Glu 

Cys 

Asp 

Ile 

Pro 





645 





650 


Cys 

Ala 

Ser 

Tyr 

His 

Thr 

Val 

Ser 

Leu 

Leu 

Arg 




660 





665 



Ser 

Ile 

Val 

Ala 

Tyr 

Thr 

Met 

Ser 

Leu 

Gly 

Ala 



675 





680 




Tyr 

Ser 

Asn 

Asn 

Thr 

Ile 

Ala 

Ile 

Pro 

Thr 

Asn 


690 





695 





Thr 

Thr 

Glu 

Val 

Met 

Pro 

Val 

Ser 

Met 

Ala 

Lys 

705 





710 





715 

Asn 

Met 

Tyr 

Ile 

Cys 

Gly 

Asp 

Ser 

Thr 

Glu 

Cys 





725 





730 


Gin 

Tyr 

Gly 

Ser 

Phe 

Cys 

Thr 

Gin 

Leu 

Asn 

Arg 




740 





745 



Ala 

Ala 

Glu 

Gin 

Asp 

Arg 

Asn 

Thr 

Arg 

Glu 

Val 



755 





760 




Gin 

Met 

Tyr 

Lys 

Thr 

Pro 

Thr 

Leu 

Lys 

Tyr 

Phe 


770 





775 





Ser 

Gin 

Ile 

Leu 

Pro 

Asp 

Pro 

Leu 

Lys 

Pro 

Thr 

785 





790 





795 

Glu 

Asp 

Leu 

Leu 

Phe 

Asn 

Lys 

Val 

Thr 

Leu 

Ala 





805 





810 


Lys 

Gin 

Tyr 

Gly 

Glu 

Cys 

Leu 

Gly 

Asp 

Ile 

Asn 




820 





825 



Cys 

Ala 

Gin 

Lys 

Phe 

Asn 

Gly 

Leu 

Thr 

Val 

Leu 



835 





840 
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Arg 

His 

445 

Gly 

Lys 

Leu 

Phe 

460 

Ser 

Pro 

Asp 

Gly 

Trp 

Pro 

Leu 

Asn 

Asp 

480 

Gin 

Pro 

Tyr 

Arg 

495 

Val 

Ala 

Thr 

Val 

510 

Cys 

Gly 

Cys 

Val 

525 

Asn 

Phe 

Asn 

Pro 

540 

Ser 

Ser 

Lys 

Arg 

Ser 

Asp 

Phe 

Thr 

Asp 

560 

Asp 

Ile 

Ser 

Pro 

575 

Cys 

Thr 

Asn 

Ala 

590 

Ser 

Ser 

Thr 

Asp 

605 

Val 

Ser 

Thr 

Arg 

620 

Ile 

Tyr 

Ser 

Thr 

Leu 

Ile 

Gly 

Ala 

Glu 

640 

Ile 

Gly 

Ala 

Gly 

655 

Ile 

Ser 

Thr 

Ser 

670 

Gin 

Lys 

Asp 

Ser 

685 

Ser 

Ile 

Ala 

Phe 

700 

Ser 

Ile 

Ser 

Ile 

Thr 

Ser 

Val 

Asp 

Cys 

720 

Ala 

Asn 

Leu 

Leu 

735 

Leu 

Ala 

Leu 

Ser 

750 

Gly 

Ile 

Phe 

Ala 

765 

Gin 

Val 

Lys 

Gly 

780 

Gly 

Phe 

Asn 

Phe 

Lys 

Arg 

Ser 

Phe 

Ile 

800 

Asp 

Ala 

Gly 

Phe 

815 

Met 

Ala 

Arg 

Asp 

830 

Leu 

Ile 

Pro 

Pro 

845 

Leu 

Leu 

Thr 
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Asp 

Asp 

Met 

Ile 

Ala 

Ala 

Tyr 

Thr 

Ala 

Ala 

Leu 

Val 

Ser 

Gly 

Thr 

Ala 


850 





855 





860 





Thr 

Ala 

Gly 

Trp 

Thr 

Phe 

Gly 

Ala 

Gly 

Ala 

Ala 

Leu 

Gin 

Ile 

Pro 

Phe 

865 





870 





875 





880 

Ala 

Met 

Gin 

Met 

Ala 

Tyr 

Arg 

Phe 

Asn 

Gly 

Ile 

Gly 

Val 

Thr 

Gin 

Asn 





885 





890 





895 


Val 

Leu 

Tyr 

Glu 

Asn 

Gin 

Lys 

Gin 

Ile 

Ala 

Asn 

Gin 

Phe 

Asn 

Lys 

Ala 




900 





905 





910 



Ile 

Ser 

Gin 

Ile 

Gin 

Glu 

Ser 

Leu 

Thr 

Thr 

Thr 

Ser 

Thr 

Ala 

Leu 

Gly 



915 





920 





925 




Lys 

Leu 

Gin 

Asp 

Val 

Val 

Asn 

Gin 

Asn 

Ala 

Gin 

Ala 

Leu 

Asn 

Thr 

Leu 


930 





935 





940 





Val 

Lys 

Gin 

Leu 

Ser 

Ser 

Asn 

Phe 

Gly 

Ala 

Ile 

Ser 

Ser 

Val 

Leu 

Asn 

945 





950 





955 





960 

Asp 

Ile 

Leu 

Ser 

Arg 

Leu 

Asp 

Lys 

Val 

Glu 

Ala 

Glu 

Val 

Gin 

Ile 

Asp 





965 





970 





975 


Arg 

Leu 

Ile 

Thr 

Gly 

Arg 

Leu 

Gin 

Ser 

Leu 

Gin 

Thr 

Tyr 

Val 

Thr 

Gin 




980 





985 





990 




Gin Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala Ala 
995 1000 1005 


Thr 

Lys 

1010 

Met 

Ser 

Glu 

Cys 

Val 

1015 

Leu 

Gly Gin 

Ser 

Lys 

1020 

Arg 

Val 

Asp 

Phe 

Cys 

1025 

Gly 

Lys 

Gly 

Tyr 

His 

1030 

Leu 

Met 

Ser 

Phe 

Pro 

1035 

Gin 

Ala 

Ala 

Pro 

His 

1040 

Gly 

Val 

Val 

Phe 

Leu 

1045 

His 

Val 

Thr 

Tyr 

Val 

1050 

Pro 

Ser 

Gin 

Glu 

Arg 

1055 

Asn 

Phe 

Thr 

Thr 

Ala 

1060 

Pro 

Ala 

Ile 

Cys 

His 

1065 

Glu 

Gly Lys 

Ala 

Tyr 

1070 

Phe 

Pro 

Arg 

Glu 

Gly 

1075 

Val 

Phe 

Val 

Phe 

Asn 

1080 

Gly 

Thr 

Ser 

Trp 

Phe 

1085 

Ile 

Thr 

Gin 

Arg 

Asn 

1090 

Phe 

Phe 

Ser 

Pro 

Gin 

1095 

Ile 

Ile 

Thr 

Thr 

Asp 

1100 

Asn 

Thr 

Phe 

Val 

Ser 

1105 

Gly 

Asn 

Cys 

Asp 

Val 

1110 

Val 

Ile Gly 

Ile 

Ile 

1115 

Asn 

Asn 

Thr 

Val 

Tyr 

1120 

Asp 

Pro 

Leu 

Gin 

Pro 

1125 

Glu 

Leu 

Asp 

Ser 

Phe 

1130 

Lys 

Glu 

Glu 

Leu 

Asp 

1135 

Lys 

Tyr 

Phe 

Lys 

Asn 

1140 

His 

Thr 

Ser 

Pro 

Asp 

1145 

Val 

Asp 

Leu 

Gly 

Asp 

1150 

Ile 

Ser 

Gly 

Ile 

Asn 

1155 

Ala 

Ser 

Val 

Val 

Asn 

1160 

Ile 

Gin 

Lys 

Glu 

Ile 

1165 

Asp 

Arg 

Leu 

Asn 

Glu 

1170 

Val 

Ala 

Lys 

Asn 

Leu 

1175 

Asn 

Glu 

Ser 

Leu 

Ile 

1180 

Asp 

Leu 

Gin 

Glu 

Leu 

1185 

Gly 

Lys 

Tyr 

Glu 

Gin 

1190 

Tyr 

Ile 

Lys 

Trp 

Pro 

1195 

Trp 

Tyr 

Val 

Trp 

Leu 

1200 

Gly 

Phe 

Ile 

Ala 

Gly 

1205 

Leu 

Ile 

Ala 

Ile 

Val 

1210 

Met 

Val 

Thr 

Ile 

Leu 

1215 

Leu 

Cys 

Cys 

Met 

Thr 

1220 

Ser 

Cys 

Cys 

Ser 

Cys 

1225 

Leu 

Lys 

Gly 

Ala 

Cys 

1230 

Ser 

Cys 

Gly 

Ser 

Cys 

1235 

Cys 

Lys 

Phe 

Asp 

Glu 

1240 

Asp 

Asp 

Ser 

Glu 

Pro 

1245 

Val 

Leu 

Lys 

Gly 

Val 

Lys 

Leu 

His 

Tyr 

Thr 
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1250 1255 


<210> SEQ ID NO 5 

<211> LENGTH: 274 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 5 


Met 

Asp 

Leu 

Phe 

Met 

Arg 

Phe 

Phe 

Thr 

Leu 

Gly 

Ser 

Ile 

Thr 

Ala 

Gin 

1 




5 





10 





15 


Pro 

Val 

Lys 

Ile 

Asp 

Asn 

Ala 

Ser 

Pro 

Ala 

Ser 

Thr 

Val 

His 

Ala 

Thr 




20 





25 





30 



Ala 

Thr 

Ile 

Pro 

Leu 

Gin 

Ala 

Ser 

Leu 

Pro 

Phe 

Gly Trp 

Leu 

Val 

Ile 



35 





40 





45 




Gly Val 

Ala 

Phe 

Leu 

Ala 

Val 

Phe 

Gin 

Ser 

Ala 

Thr 

Lys 

Ile 

Ile 

Ala 


50 





55 





60 





Leu 

Asn 

Lys 

Arg 

Trp 

Gin 

Leu 

Ala 

Leu 

Tyr 

Lys 

Gly Phe 

Gin 

Phe 

Ile 

65 





70 





75 





80 

Cys 

Asn 

Leu 

Leu 

Leu 

Leu 

Phe 

Val 

Thr 

Ile 

Tyr 

Ser 

His 

Leu 

Leu 

Leu 





85 





90 





95 


Val 

Ala 

Ala 

Gly Met 

Glu 

Ala 

Gin 

Phe 

Leu 

Tyr 

Leu 

Tyr 

Ala 

Leu 

Ile 




100 





105 





110 



Tyr 

Phe 

Leu 

Gin 

Cys 

Ile 

Asn 

Ala 

Cys 

Arg 

Ile 

Ile 

Met 

Arg 

Cys 

Trp 



115 





120 





125 




Leu 

Cys 

Trp 

Lys 

Cys 

Lys 

Ser 

Lys 

Asn 

Pro 

Leu 

Leu 

Tyr 

Asp 

Ala 

Asn 


130 





135 





140 





Tyr 

Phe 

Val 

Cys 

Trp 

His 

Thr 

His 

Asn 

Tyr Asp 

Tyr 

Cys 

Ile 

Pro 

Tyr 

145 





150 





155 





160 

Asn 

Ser 

Val 

Thr 

Asp 

Thr 

Ile 

Val 

Val 

Thr 

Glu Gly Asp 

Gly 

Ile 

Ser 





165 





170 





175 


Thr 

Pro 

Lys 

Leu 

Lys 

Glu 

Asp 

Tyr 

Gin 

Ile Gly Gly Tyr 

Ser 

Glu 

Asp 




180 





185 





190 



Arg 

His 

Ser 

Gly Val 

Lys 

Asp 

Tyr 

Val 

Val 

Val 

His 

Gly 

Tyr 

Phe 

Thr 



195 





200 





205 




Glu 

Val 

Tyr 

Tyr 

Gin 

Leu 

Glu 

Ser 

Thr 

Gin 

Ile 

Thr 

Thr 

Asp 

Thr Gly 


210 





215 





220 





Ile 

Glu 

Asn 

Ala 

Thr 

Phe 

Phe 

Ile 

Phe 

Asn 

Lys 

Leu 

Val 

Lys 

Asp 

Pro 

225 





230 





235 





240 

Pro 

Asn 

Val 

Gin 

Ile 

His 

Thr 

Ile 

Asp 

Gly 

Ser 

Ser 

Gly 

Val 

Ala 

Asn 





245 





250 





255 


Pro 

Ala 

Met 

Asp 

Pro 

Ile 

Tyr 

Asp 

Glu 

Pro 

Thr 

Thr 

Thr 

Thr 

Ser 

Val 




260 





265 





270 



Pro 

Leu 















<210> SEQ ID 

' NO 

6 












<2 11> LENGTH 

!: 154 












<212> TYPE: 

PRT 













<213> ORGANISM: 

Coronavirus 










<400> SEQUENCE: 

6 












Met 

Met 

Pro 

Thr 

Thr 

Leu 

Phe 

Ala 

Gly 

Thr 

His 

Ile 

Thr 

Met 

Thr 

Thr 

1 




5 





10 





15 


Val 

Tyr 

His 

Ile 

Thr 

Val 

Ser 

Gin 

Ile 

Gin 

Leu 

Ser 

Leu 

Leu 

Lys 

Val 




20 





25 





30 



Thr 

Ala 

Phe 

Gin 

His 

Gin 

Asn 

Ser 

Lys 

Lys 

Thr 

Thr 

Lys 

Leu 

Val 

Val 



35 





40 





45 
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Ile 

Leu 

Arg 

Ile 

Gly 

Thr 

Gin 

Val 

Leu 

Lys 

Thr 

Met 

Ser 

Leu 

Tyr 

Met 


50 





55 





60 





Ala 

Ile 

Ser 

Pro 

Lys 

Phe 

Thr 

Thr 

Ser 

Leu 

Ser 

Leu 

His 

Lys 

Leu 

Leu 

65 





70 





75 





80 

Gin 

Thr 

Leu 

Val 

Leu 

Lys 

Met 

Leu 

His 

Ser 

Ser 

Ser 

Leu 

Thr 

Ser 

Leu 





85 





90 





95 


Leu 

Lys 

Thr 

His 

Arg 

Met 

Cys 

Lys 

Tyr 

Thr 

Gin 

Ser 

Thr 

Ala 

Leu 

Gin 




100 





105 





110 



Glu 

Leu 

Leu 

Ile 

Gin 

Gin 

Trp 

Ile 

Gin 

Phe 

Met 

Met 

Ser 

Arg 

Arg 

Arg 



115 





120 





125 




Leu 

Leu 

Ala 

Cys 

Leu 

Cys 

Lys 

His 

Lys 

Lys 

Val 

Ser 

Thr 

Asn 

Leu 

Cys 


130 





135 





140 





Thr 

His 

Ser 

Phe 

Arg 

Lys 

Lys 

Gin 

Val 

Arg 







145 





150 












<210> SEQ ID NO 7 

<211> LENGTH: 76 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 7 


Met 

Tyr 

Ser 

Phe 

Val 

Ser 

Glu 

Glu 

Thr Gly 

Thr 

Leu 

Ile 

Val 

Asn 

Ser 

1 




5 





10 





15 


Val 

Leu 

Leu 

Phe 

Leu 

Ala 

Phe 

Val 

Val 

Phe 

Leu 

Leu 

Val 

Thr 

Leu 

Ala 




20 





25 





30 



Ile 

Leu 

Thr 

Ala 

Leu 

Arg 

Leu 

Cys 

Ala 

Tyr 

Cys 

Cys 

Asn 

Ile 

Val 

Asn 



35 





40 





45 




Val 

Ser 

Leu 

Val 

Lys 

Pro 

Thr 

Val 

Tyr 

Val 

Tyr 

Ser 

Arg 

Val 

Lys 

Asn 


50 





55 





60 





Leu 

Asn 

Ser 

Ser 

Glu 

Gly 

Val 

Pro 

Asp 

Leu 

Leu 

Val 





65 





70 





75 







<210> SEQ ID NO 8 

<211> LENGTH: 221 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 8 


Met 

Ala 

Asp 

Asn 

Gly 

Thr 

Ile 

Thr 

Val 

Glu 

Glu 

Leu 

Lys 

Gin 

Leu 

Leu 

1 




5 





10 





15 


Glu 

Gin 

Trp 

Asn 

Leu 

Val 

Ile 

Gly 

Phe 

Leu 

Phe 

Leu 

Ala 

Trp 

Ile 

Met 




20 





25 





30 



Leu 

Leu 

Gin 

Phe 

Ala 

Tyr 

Ser 

Asn 

Arg 

Asn 

Arg 

Phe 

Leu 

Tyr 

Ile 

Ile 



35 





40 





45 




Lys 

Leu 

Val 

Phe 

Leu 

Trp 

Leu 

Leu 

Trp 

Pro 

Val 

Thr 

Leu 

Ala 

Cys 

Phe 


50 





55 





60 





Val 

Leu 

Ala 

Ala 

Val 

Tyr 

Arg 

Ile 

Asn 

Trp 

Val 

Thr 

Gly Gly 

Ile 

Ala 

65 





70 





75 





80 

0) 
i— i 
H 

Ala 

Met 

Ala 

Cys 

Ile 

Val 

Gly 

Leu 

Met 

Trp 

Leu 

Ser 

Tyr 

Phe 

Val 





85 





90 





95 


Ala 

Ser 

Phe 

Arg 

Leu 

Phe 

Ala 

Arg 

Thr 

Arg 

Ser 

Met 

Trp 

Ser 

Phe 

Asn 




100 





105 





110 



Pro 

Glu 

Thr 

Asn 

Ile 

Leu 

Leu 

Asn 

Val 

Pro 

Leu 

Arg 

Gly Thr 

Ile 

Val 



115 





120 





125 





Thr Arg Pro Leu Met Glu Ser Glu Leu Val Ile Gly Ala Val Ile Ile 
130 135 140 
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Arg 

Gly 

His 

Leu 

Arg 

Met 

Ala 

Gly 

His 

Pro 

Leu 

Gly Arg 

Cys 

Asp 

Ile 

145 





150 





155 





160 

Lys 

Asp 

Leu 

Pro 

Lys 

Glu 

Ile 

Thr 

Val 

Ala 

Thr 

Ser 

Arg 

Thr 

Leu 

Ser 





165 





170 





175 


Tyr 

Tyr 

Lys 

Leu 

Gly 

Ala 

Ser 

Gin 

Arg 

Val 

Gly Thr 

Asp 

Ser 

Gly Phe 




180 





185 





190 



Ala 

Ala 

Tyr 

Asn 

Arg 

Tyr 

Arg 

Ile 

Gly 

Asn 

Tyr 

Lys 

Leu 

Asn 

Thr 

Asp 



195 





200 





205 




His 

Ala 

Gly 

Ser 

Asn 

Asp 

Asn 

Ile 

Ala 

Leu 

Leu 

Val 

Gin 





210 215 220 


<210> SEQ ID NO 9 

<211> LENGTH: 63 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 9 


Met 

Phe 

His 

Leu 

Val 

Asp 

Phe 

Gin 

Val 

Thr 

Ile 

Ala 

Glu 

Ile 

Leu 

Ile 

1 




5 





10 





15 


Ile 

Ile 

Met 

Arg 

Thr 

Phe 

Arg 

Ile 

Ala 

Ile 

Trp 

Asn 

Leu 

Asp 

Val 

Ile 




20 





25 





30 



Ile 

Ser 

Ser 

Ile 

Val 

Arg 

Gin 

Leu 

Phe 

Lys 

Pro 

Leu 

Thr 

Lys 

Lys 

Asn 



35 





40 





45 




Tyr 

Ser 

Glu 

Leu 

Asp 

Asp 

Glu 

Glu 

Pro 

Met 

Glu 

Leu 

Asp 

Tyr 

Pro 



50 





55 





60 






<210> SEQ ID NO 10 

<211> LENGTH: 122 

<212> TYPE: PRT 

<213> ORGANISM: Coronavirus 


<400> SEQUENCE: 10 


Met 

Lys 

Ile 

Ile 

Leu 

Phe 

Leu 

Thr 

Leu 

Ile 

Val 

Phe 

Thr 

Ser 

Cys 

Glu 

1 




5 





10 





15 


Leu 

Tyr 

His 

Tyr 

Gin 

Glu 

Cys 

Val 

Arg 

Gly Thr 

Thr 

Val 

Leu 

Leu 

Lys 




20 





25 





30 



Glu 

Pro 

Cys 

Pro 

Ser 

Gly 

Thr 

Tyr 

Glu 

Gly 

Asn 

Ser 

Pro 

Phe 

His 

Pro 



35 





40 





45 




Leu 

Ala 

Asp 

Asn 

Lys 

Phe 

Ala 

Leu 

Thr 

Cys 

Thr 

Ser 

Thr 

His 

Phe 

Ala 


50 





55 





60 





Phe 

Ala 

Cys 

Ala 

Asp 

Gly 

Thr 

Arg 

His 

Thr 

Tyr 

Gin 

Leu 

Arg 

Ala 

Arg 

65 





70 





75 





80 

Ser 

Val 

Ser 

Pro 

Lys 

Leu 

Phe 

Ile 

Arg 

Gin 

Glu 

Glu 

Val 

Gin 

Gin 

Glu 





85 





90 





95 


Leu 

Tyr 

Ser 

Pro 

Leu 

Phe 

Leu 

Ile 

Val 

Ala 

Ala 

Leu 

Val 

Phe 

Leu 

Ile 




100 





105 





110 



Leu 

Cys 

Phe 

Thr 

Ile 

Lys 

Arg 

Lys 

Thr 

Glu 









115 





120 










A 

K) 

O 

V 

SEQ ID NO 

11 

<211> 

LENGTH: 8 4 


<212> 

TYPE: PRT 


<213> 

ORGANISM: 

Coronavirus 

A 

o 

o 

V 

SEQUENCE: 

11 


Met Cys Leu Lys Ile Leu Val Arg Tyr Asn Thr Arg Gly Asn Thr Tyr 
15 10 15 
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Ser 

Thr 

Ala 

Trp 

Leu 

Cys 

Ala 

Leu 

Gly 

Lys 

Val 

Leu 

Pro 

Phe 

His 

Arg 




20 





25 





30 



Trp 

His 

Thr 

Met 

Val 

Gin 

Thr 

Cys 

Thr 

Pro 

Asn 

Val 

Thr 

Ile 

Asn 

Cys 



35 





40 





45 




Gin 

Asp 

Pro 

Ala 

Gly 

Gly 

Ala 

Leu 

Ile 

Ala 

Arg 

Cys 

Trp 

Tyr 

Leu 

His 


50 





55 





60 





Glu 

Gly 

His 

Gin 

Thr 

Ala 

Ala 

Phe 

Arg 

Asp 

Val 

Leu 

Val 

Val 

Leu 

Asn 

65 





70 





75 





80 

Lys 

Arg 

Thr 

Asn 













<210> SEQ IE 

i NO 

12 












<211> LENGTH 

;: 422 












<212> TYPE: 

PRT 













<213> ORGANISM: 

Coronavirus 










<400> SEQUENCE: 

12 












Met 

Ser 

Asp 

Asn 

Gly 

Pro 

Gin 

Ser 

Asn 

Gin 

Arg 

Ser 

Ala 

Pro 

Arg 

Ile 

1 




5 





10 





15 


Thr 

Phe 

Gly 

Gly 

Pro 

Thr 

Asp 

Ser 

Thr 

Asp 

Asn 

Asn 

Gin 

Asn 

Gly Gly 




20 





25 





30 



Arg 

Asn 

Gly 

Ala 

Arg 

Pro 

Lys 

Gin 

Arg 

Arg 

Pro 

Gin 

Gly 

Leu 

Pro 

Asn 



35 





40 





45 




Asn 

Thr 

Ala 

Ser 

Trp 

Phe 

Thr 

Ala 

Leu 

Thr 

Gin 

His 

Gly 

Lys 

Glu 

Glu 


50 





55 





60 





Leu 

Arg 

Phe 

Pro 

Arg 

Gly 

Gin 

Gly 

Val 

Pro 

Ile 

Asn 

Thr 

Asn 

Ser 

Gly 

65 





70 





75 





80 

Pro 

Asp 

Asp 

Gin 

Ile 

Gly 

Tyr 

Tyr 

Arg 

Arg 

Ala 

Thr 

Arg 

Arg 

Val 

Arg 





85 





90 





95 


Gly Gly 

Asp 

Gly 

Lys 

Met 

Lys 

Glu 

Leu 

Ser 

Pro 

Arg 

Trp 

Tyr 

Phe 

Tyr 




100 





105 





110 



Tyr 

Leu 

Gly 

Thr 

Gly 

Pro 

Glu 

Ala 

Ser 

Leu 

Pro 

Tyr 

Gly 

Ala 

Asn 

Lys 



115 





120 





125 




Glu 

Gly 

Ile 

Val 

Trp 

Val 

Ala 

Thr 

Glu 

Gly 

Ala 

Leu 

Asn 

Thr 

Pro 

Lys 


130 





135 





140 





Asp 

His 

Ile 

Gly 

Thr 

Arg 

Asn 

Pro 

Asn 

Asn 

Asn 

Ala 

Ala 

Thr 

Val 

Leu 

145 





150 





155 





160 

Gin 

Leu 

Pro 

Gin 

Gly 

Thr 

Thr 

Leu 

Pro 

Lys 

Gly 

Phe 

Tyr 

Ala 

Glu Gly 





165 





170 





175 


Ser 

Arg 

Gly 

Gly 

Ser 

Gin 

Ala 

Ser 

Ser 

Arg 

Ser 

Ser 

Ser 

Arg 

Ser 

Arg 




180 





185 





190 



Gly 

Asn 

Ser 

Arg 

Asn 

Ser 

Thr 

Pro 

Gly 

Ser 

Ser 

Arg 

Gly 

Asn 

Ser 

Pro 



195 





200 





205 




Ala 

Arg 

Met 

Ala 

Ser 

Gly 

Gly 

Gly 

Glu 

Thr 

Ala 

Leu 

Ala 

Leu 

Leu 

Leu 


210 





215 





220 





Leu 

Asp 

Arg 

Leu 

Asn 

Gin 

Leu 

Glu 

Ser 

Lys 

Val 

Ser 

Gly 

Lys 

Gly Gin 

225 





230 





235 





240 

Gin 

Gin 

Gin 

Gly 

Gin 

Thr 

Val 

Thr 

Lys 

Lys 

Ser 

Ala 

Ala 

Glu 

Ala 

Ser 





245 





250 





255 


Lys 

Lys 

Pro 

Arg 

Gin 

Lys 

Arg 

Thr 

Ala 

Thr 

Lys 

Gin 

Tyr 

Asn 

Val 

Thr 




260 





265 





270 



Gin 

Ala 

Phe 

Gly 

Arg 

Arg 

Gly 

Pro 

Glu 

Gin 

Thr 

Gin 

Gly 

Asn 

Phe 

Gly 



275 





280 





285 




Asp 

Gin 

Asp 

Leu 

Ile 

Arg 

Gin 

Gly 

Thr 

Asp 

Tyr 

Lys 

His 

Trp 

Pro 

Gin 


290 





295 





300 
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<1) 

1 — 1 
H 

Ala 

Gin 

Phe 

Ala 

Pro 

Ser 

Ala 

Ser 

Ala 

Phe 

Phe 

Gly Met 

Ser 

Arg 

305 





310 





315 





320 

o 

■— i 
H 

Gly 

Met 

Glu 

Val 

Thr 

Pro 

Ser 

Gly 

Thr 

Trp 

Leu 

Thr 

Tyr 

His 

Gly 





325 





330 





335 


Ala 

Ile 

Lys 

Leu 

Asp 

Asp 

Lys 

Asp 

Pro 

Gin 

Phe 

Lys 

Asp 

Asn 

Val 

Ile 




340 





345 





350 



Leu 

Leu 

Asn 

Lys 

His 

Ile 

Asp 

Ala 

Tyr 

Lys 

Thr 

Phe 

Pro 

Pro 

Thr 

Glu 



355 





360 





365 




Pro 

Lys 

Lys 

Asp 

Lys 

Lys 

Lys 

Lys 

Thr 

Asp 

Glu 

Ala 

Gin 

Pro 

Leu 

Pro 


370 





375 





380 





Gin 

Arg 

Gin 

Lys 

Lys 

Gin 

Pro 

Thr 

Val 

Thr 

Leu 

Leu 

Pro 

Ala 

Ala 

Asp 

385 





390 





395 





400 

Met 

Asp 

Asp 

Phe 

Ser 

Arg 

Gin 

Leu 

Gin 

Asn 

Ser 

Met 

Ser 

Gly Ala 

Ser 





405 





410 





415 


Ala 

Asp 

Ser 

Thr 

Gin 

Ala 












420 


<210> SEQ ID NO 13 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 13 

ctaacatgct taggataatg g 21 


<210> SEQ ID NO 14 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 14 

gcctctcttg ttcttgctcg c 21 


<210> SEQ ID NO 15 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 15 

caggtaagcg taaaactcat c 21 


<210> SEQ ID NO 16 
<211> LENGTH: 22 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 16 

catgtgtggc ggctcactat at 22 


<210> SEQ ID NO 17 
<211> LENGTH: 28 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 
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-continued 


<400> SEQUENCE: 17 

gacactatta gcataagcag ttgtagca 28 


<210> SEQ ID NO 18 
<211> LENGTH: 27 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 18 

ttaaaccagg tggaacatca tccggtg 27 


<210> SEQ ID NO 19 
<211> LENGTH: 22 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 19 

ggagccttga atacacccaa ag 22 


<210> SEQ ID NO 20 
<211> LENGTH: 17 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 20 

gcacggtggc agcattg 17 


<210> SEQ ID NO 21 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 21 

ccacattggc acccgcaatc c 21 


<210> SEQ ID NO 22 
<211> LENGTH: 19 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 
<400> SEQUENCE: 22 


caaacattgg ccgcaaatt 


19 


<210> SEQ ID NO 23 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 
<400> SEQUENCE: 23 


caatgcgtga cattccaaag a 


21 


<210> SEQ ID NO 24 
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<211> LENGTH: 26 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 
<400> SEQUENCE: 24 


cacaatttgc tccaagtgcc tctgca 


26 


<210> SEQ ID NO 25 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 25 

gaagtaccat ctggggctga g 21 


<210> SEQ ID NO 26 
<211> LENGTH: 18 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 26 

ccgaagagct acccgacg 18 


<210> SEQ ID NO 27 
<211> LENGTH: 26 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 27 

ctctttcatt ttgccgtcac caccac 26 


<210> SEQ ID NO 28 
<211> LENGTH: 24 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 2 8 

agctctccct agcattattc actg 24 


<210> SEQ ID NO 29 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 29 

caccacattt tcatcgaggc 20 


<210> SEQ ID NO 30 
<211> LENGTH: 22 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 


<400> SEQUENCE: 30 
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taccctcgat cgtactccgc gt 


22 


<210> SEQ ID NO 31 
<211> LENGTH: 23 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 31 

tgtaggcact gattcaggtt ttg 23 


<210> SEQ ID NO 32 
<211> LENGTH: 23 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 32 

cggcgtggtc tgtatttaat tta 23 


<210> SEQ ID NO 33 
<211> LENGTH: 27 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 33 

ctgcatacaa ccgctaccgt attggaa 27 


<210> SEQ ID NO 34 
<211> LENGTH: 24 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 3 4 

gggttgggac tatcctaagt gtga 24 


<210> SEQ ID NO 35 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<220> FEATURE: 

<221> NAME/KEY: misc_feature 
<222> LOCATION: (12).. (12) 

<223> OTHER INFORMATION: "n" equals inosine. 

<400> SEQUENCE: 35 

taacacacaa cnccatcatc a 21 


<210> SEQ ID NO 36 
<211> LENGTH: 19 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 
<400> SEQUENCE: 36 


agatttggac ctgcgagcg 


19 
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<210> SEQ ID NO 37 
<211> LENGTH: 20 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 
<400> SEQUENCE: 37 


gagcggctgt ctccacaagt 


20 


<210> SEQ ID NO 38 
<211> LENGTH: 23 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide. 

<400> SEQUENCE: 38 

ttctgacctg aaggctctgc gcg 23 


We claim: 

1. An isolated nucleic acid molecule consisting of the 
nucleotide sequence as set forth in SEQ ID NO: 1. * * * * * 





